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The invention relates to the extraction of information, relating to a collection of data entities 
or resources having relations between them. Entities are objects, such as documents, 
users, books, movies, words, relational tables, etc., about which a user would like to extract 
some information. 

An affinity may be thought of broadly as a similarity measure between the two 
entities. For instance, if the two entities are two text documents, then one possible 
affinity is the number of words one document has in common with the other. 
One of the sets of entities consists of human subjects. For instance, for entity sets of persons 
and movies, an affinity might be a quantitative measurement of how well a person likes one of 
the movies. One possible such affinity is the familiar one-to four-star rating scheme, 
affinity values are combined to form a similarity value for each respective one of the entities, 
a similarity value for the entity and for each respective other one of the entities in the 
collection 

The similarity between two entities is a measure of how similar they are in terms of their 
affinity relationships. 

The idea is that we wish to rank the entities by these significances, which in most cases are 
designed to correlate strongly with subjective qualities like goodness or desirability. 
Significance values for a given entity are indexed according to their corresponding 
similarity measures, (the level/degree of similarity) 

The goal of the invention is to elicit interesting structure from a collection of entities or 
resources with explicit and/or implicit, static and/or dynamic relations between them. 
Interesting structure includes (1) notions of quality or authority, for instance when seeking 
definitive sources of information, (2) notions of relevance to the user's information need, (3) 
notions of similarity among the plurality of resources retrieved from a universe of 
resources by a query process, and (4) notions of similarity among the usages of resources 
by different users/servers (often with the purpose of grouping similar users so that they can 
gain from resources that other users have explored, a process called "collaborative 
filtering" 
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AREA EXTRACTING METHOD 



PUB. NO. : 
PUBLISHED: 
INVENTOR (s) : 

APPLICANT (s) 

APPL. NO.: 
FILED: 
INTL CLASS: 
JAPIO CLASS: 
JOURNAL : 



04-278683 [JP 4278683 A] 
October 05, 1992 (19921005) 
EGAWA KOI CHI 
YOSHITAKE TOSHIYUKI 

FUJITSU LTD [000522] (A Japanese Company or Corporation), JP 
(Japan) 

03-041624 [JP 9141624] 
March 07, 1991 (19910307) 
[5] G06F-015/70 

4 5.4 (INFORMATION PROCESSING -- Computer Applications) 
Section: P, Section No. 1487, Vol. 17, No. 76, Pg. 14, 
February 16, 1993 (19930216) 



ABSTRACT 

PURPOSE: To extract the area of a subject picture element in 
consideration of the subject picture element and the attribute value of 
a picture element peripheral to the subject picture element . 
CONSTITUTION: The attribute value of a picture element of an original 

image and the attribute value of other picture elements (8 picture 

elements in a diagram) adjacent to the above picture element are shown 
in the contrasts set with use of the dispersion of the density value 

secured in a part consisting of 9 picture elements and the differential 
statistic value showing the distribution of the differences of density 
between two picture elements . An intermediate image is obtained with 
those attribute values defined as the attribute value of a single 
picture element . Then the attribute value of each picture element of 
such intermediate image includes the pattern information on a small part 
consisting of a subject picture element of an original image and the 
picture elements adjacent to the subject picture element. Thus it is 
possible to extract an area based on the difference of patterns included in 
the image and through the extraction of the intermediate image. Furthermore 
the data form shows the data on each picture element in regard of the 
extraction of the intermediate image . Therefore this methods is highly 

affinitive with various conventional methods so that the existing 
software property can be effectively used. 
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WPI Acc No: 2003-380420/200336 

XRPX Acc No: N03-303823 

List generating device automatically generates query for list of people 
based on terms acquired from received content that is associated with one 
task, so that list of people is generated based on response to query 

Patent Assignee: KESKAR D V (KESK-I); SANVITALE D (SANV-I) 

Inventor: KESKAR D V; SANVITALE D * 

Number of Countries: 001 Number of Patents: 001 

Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

US 20030028524 Al 20030206 US 2001919351 A 20010731 200336 B 

Priority Applications (No Type Date) : US 2001919351 A 20010731 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
US 20030028524 Al 10 G06F-007/00 

Abstract (Basic) : US 20030028524 Al 

NOVELTY - A query for a list of people is generated based on the 



terms acquired from the received content that is associated with a 
task, so that a list of people is generated based on the response to 
the query. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following : 

(1) a recorded medium stored with program for generating a list of 
people; 

(2) a computer system for generating list of people; and 

(3) a list of people generating method. 

USE - For generating list of people relevant to task. 

ADVANTAGE - Enables user to provide a feedback regarding the level 
of interest with respect to document. Increases the efficiency of a 
task process by providing a list of people relevant to the task and 
content. Eliminates the need to search through a large address book. 
Provides correlation across different tasks. Enhances degree of 
closeness between individuals in a working group . 

DESCRIPTION OF DRAWING (S) - The figure shows the block diagram of 
the computer system. 

pp; 10 DwgNo 1/4 
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Derwent Class: T01 

International Patent Class (Main) : G06F-007/00 
File Segment: EPI 
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WPI Acc No: 2002-425004/200245 

XRPX Acc No: N02-334146 

Online live auction platform provision method in distributed virtual 
network, involves receiving response from owner of item for request 
generated by member of group having similar characteristics and 
interests 

Patent Assignee: ABRAMS H A (ABRA-I ) ; BARNES M C (BARN- I) ; CHENG G S H 
(CHEN-I); GRABER G F (GRAB- I ) ; LINDO J B (LIND-I); WHITE P R (WHIT-I) 

Inventor: ABRAMS H A; BARNES M C; CHENG G S H; GRABER G F; LINDO J B; WHITE 
P R 

Number of Countries: 001 Number of Patents: 001 
Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

US 20020032634 Al 20020314 US 2000659215 A 20000911 200245 B 

US 2000752585 A 20001227 

Priority Applications (No Type Date): US 2000752585 A 20001227; US 

2000659215 A 20000911 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 

US 20020032634 Al 15 G06F-017/60 CIP of application US 2000659215 

Abstract (Basic) : US 20020032634 Al 

NOVELTY - A list of items being auctioned among group of 
members having similar information sources, interests, characteristics 
and pre-existing relationships is accessed. A request including one of 
the items and bidding price is generated from a member. The member 
receives a response from the owner of one of the items, corresponding 
to generated request. 

DETAILED DESCRIPTION - An INDEPENDENT CLAIM is also included for 
online auction platform providing system. 

USE - For providing online live auction platform in distributed 
virtual network such as Internet, wireless networks such as GSM, CDMA, 
TDMA, PHS wireless networks applied to any auction environment, where 
communities of members are merged using relationships with other 
members . 



ADVANTAGE - Members of the linked communities are in control of the 
rules and conditions governing interactions in the linked auction 
communities. Community information can be segregated into public and 
non-public storage areas with item level control of the information. 
Auction items have high affinity with the background of the 
possible bidders so that auction items are more appreciated. 
Emphasizes more on person -to- person interaction in the online 
actions and hence eliminates need for middle broker. 

DESCRIPTION OF DRAWING (S) - The figure shows the block diagram of 
networked communication system for implementing online auction platform 
providing method. 

pp; 15 DwgNo 1/8 
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International Patent Class (Main) : G06F-017/60 
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WPI Acc No: 2002-011322/200201 

XRPX Acc No: N02-009372 

Computer network implemented method for collaborative individual bench 
marking determining bench mark value for person based on bench mark 
data and ranking them against stored values for others 

Patent Assignee: NETGUILDS INC (NETG-N) 

Inventor: FARKAS B; KESSENICH K 0; SEIFMAN D 

Number of Countries: 095 Number of Patents: 002 

Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

WO 200182171 Al 20011101 WO 2001US11730 A 20010425 200201 B 
AU 200157007 A 20011107 AU 200157007 A 20010425 200219 

Priority Applications (No Type Date) : US 2000199388 P 20000425 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 

WO 200182171 Al E 37 G06F-017/60 

Designated States (National) : AE AG AL AM AT AU AZ BA BB BG BR 
CH CN CO CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID 
JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ 
PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA 
Designated States (Regional) : AT BE CH CY DE DK EA ES FI FR GB 
IE IT KE LS LU MC MW MZ NL OA PT SD SE SL SZ TR TZ UG ZW 

AU 200157007 A G06F-017/60 Based on patent WO 200182171 

Abstract (Basic) : WO 200182171 Al 

NOVELTY - The method involves receiving a person's personal bench 
mark data for a bench mark at a bench marking computer. A bench mark 
value is determined for the person based upon the bench mark data. 
The bench mark value is ranked for the person relative to stored 
values for other persons for the bench mark. An indicator is provided 
based at least in part upon the ranking. 

The method further involves transmitting the bench mark data from a 
client computer to the bench marking computer 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are included for a 
computer network system, for a computer implemented method of building 
a data base of personal affinity group information and for a computer 
implemented data base of personal affinity group information. 

USE - For personal management . 

ADVANTAGE - Provides a person information about their relative 
personal performance in their personal affinity group . Allows user 

to compare usefulness of specific products and services directed to 
improving person 1 s relative personal performance. 
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DESCRIPTION OF DRAWING (S) - The figure shows a network of digital 
computers including computers hosting a novel bench marking computer 
system. 

pp; 37 DwgNo 1/8 
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Derwent Class: T01 
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File Segment: EPI 



15/5/5 (Item 4 from file: 350) 

DIALOG (R) File 350: Derwent WPIX 

(c) 2003 Thomson Derwent. All rts. reserv. 



014098119 **Image available** 

WPI Acc No: 2001-582333/200165 

XRPX Acc No: N01-433832 

Benchmarking based determination system of best practices comprises 
benchmarking industry affinity group members against other 
comparable members by gathering data to calculate indicators to compare 

Patent Assignee: NETGUILDS INC (NETG-N) 

Inventor: CHIAT J; FARKAS B; SEIFMAN D H 

Number of Countries: 095 Number of Patents: 003 

Patent Family: 

Applicat No Kind Date Week 
WO 2001US4948 A 20010307 200165 B 
AU 200145281 A 20010307 200204 
EP 2001918175 A 20010307 200310 
WO 2001US4948 A 20010307 



Patent No 
WO 200167342 
AU 200145281 
EP 1281141 



Kind Date 
Al 20010913 
A 20010917 
Al 20030205 



Priority Applications (No Type Date) : US 2000215076 P 20000630; US 

2000187703 P 20000307; US 2000556787 A 20000425 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 

WO 200167342 Al E 61 G06F-017/60 

Designated States (National) : AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA 
CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP 
KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT 
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Designated States (Regional) : AT BE CH CY DE DK EA ES FI FR GB GH GM GR 
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AU 200145281 A G06F-017/60 Based on patent WO 200167342 

EP 1281141 Al E G06F-017/60 Based on patent WO 200167342 

Designated States (Regional) : AL AT BE CH CY DE DK ES FI FR GB GR IE IT 
LI LT LU LV MC MK NL PT RO SE SI TR 



Abstract (Basic) : WO 200167342 Al 

NOVELTY - The method comprises of system gathering data and 
calculating indicators specific to each member based on collected 
data and comparing values to provide a database of company (5) 
information and employee status. This also provides means to 
automatically identify, offer and sell improved products and services 
based on benchmark (3) comparisons. Also provides results database 
information to subscribed or affiliated members (4) for market or 
self -assessment 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are included for the 
system used to implement the method. Also for the construction and 
generation of the database containing data relevant to the invention 
and also for the computer program product used by the computer 
implemented system also included. Also included is the process for 
determining best practices 

USE - Used to provide a collaborative bench mark based 
determination system to generate a database of relevant comparative 
information 

ADVANTAGE - Provides information about relative performance 
information to industry affinity groups while providing means for 



building a database of benchmarked company and employee information for 
the purpose of improving service and product usefulness and performance 
and best practices used in affinity groups by correlation and 
comparison 

DESCRIPTION OF DRAWING (S) - The drawing shows a schematic of a 
network of digital computers including hosting a benchmarking system 
Benchmark system (3) 

Users, subscribers, affiliated members (4) 

Company (5) 
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Computer implemented method for providing recommendations, involves 
determining affinity between users by analyzing partitioned 
preference data associated with stored data with respect to item 

Patent Assignee: NET PERCEPTIONS INC (NETP-N) 

Inventor: BIEGANSKI P; DRISKILL R; FRANKOWSKI D; GURALNIK V; MULIER F 
Number of Countries: 093 Number of Patents: 002 
Patent Family: 
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2000177213 P 20000121 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 

WO 200153973 A2 E 46 G06F-017/00 
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Abstract (Basic) : WO 200153973 A2 

NOVELTY - The method involves determining affinity between two 
users by analyzing partitioned preference data associated with stored 
data that reflects positive and negative preferences expressed by each 
one of a set of users with respect to an item. A recommendation is 
provided based on determined affinity. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following : 

(a) System for providing recommendation; 

(b) Computer readable medium 

USE - For providing recommendations used in e-commerce business 
activities . 

ADVANTAGE - Allows e-commerce operators to take advantage of 
customer databases to provide valuable personalized service to 
customers . Provides a recommendation server with software capable of 
using rating space partitioned (RSP) data recommendations to user. 

DESCRIPTION OF DRAWING (S) - The figure shows the data processing 
system suitable for practicing methods and systems consistent with the 
recommendation providing system. 
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Information structuring method for visualization of data sets containing 
large number of objects by ranking based on strength, clustering 
related objects and computing number of affinity charts per object 

Patent Assignee: NAPSTER INC (NAPS-N) ; GIGABEAT INC (GIGA-N) 

Inventor: JANNINK J F 

Number of Countries: 093 Number of Patents: 002 
Patent Family: 
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Abstract (Basic): WO 200135271 A2 

NOVELTY - An affinity value is calculated between related and 
selected objects to rank and order objects in the data set . A 
continuous curve of spiral segments is employed to connect items at 
different intensity levels, the results are presented and adjusted 
using an affinity chart with color, curve thickness and shading 
gradation selectively employed to emphasize curve's role in conveying 
affinity strength while placing related items . 

DETAILED DESCRIPTION - INDEPENDENT claims are also included for a 
system for providing visualization of items from data sets , a 
computer program stored on a computer readable medium, a chart server 
for creating a graphical layout of a visualization of an interrelated 
data set. 

USE - For visualization and hyper linking of data sets containing 
large number of objects e.g. relational database, object database or 
XML document . 

ADVANTAGE - It enhances clarity of the visualization to avoid 
information overlap and overload with items grouped by strength of 
affinity . 

DESCRIPTION OF DRAWING (S) - The figure shows flow chart 
illustrating method of visualizing large interrelated data sets, 
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Abstract (Basic): WO 200060507 A2 

NOVELTY - A computer-based method comprising defining a set of 
constraints on possible target surfaces, and defining a fully 
enumerated set of theoretical target surfaces under the defined 
constraints, so that each surface has a defined, continuous volume and 
a defined, continuous surface area, is new. 

DETAILED DESCRIPTION - A computer-based method comprising defining 
a set of constraints on possible target surfaces, and defining a fully 
enumerated set of theoretical target surfaces under the defined 
constraints, so that each surface has a defined, continuous volume and 
a defined, continuous surface area, is new. The method further 
comprises mapping sets of objects to the fully enumerated set of 
theoretical target surfaces to define corresponding subsets of the 
fully enumerated set of theoretical target surfaces, and analyzing an 
aspect of diversity of the objects based on degrees of similarities 
and differences among the corresponding subsets. 

An INDEPENDENT CLAIM is also included for a computer programmed to 
determine the chemical similarity of different molecules, the program 
comprising : 

(a) approximating the surface shape of molecules of interest by 
linking a series of cubes, each having a dimension R, the locations of 
the cubes being determined by the calculated electron probability 
density of the individual molecule of interest, each cube sharing at 
least one of its six faces with another cube, so that there is a 
specific number of linked cubes which varies for each molecule of 
interest ; 

(b) approximating the chemical reactivity of each individual 
molecule of interest by assigning each cube of each individual 
molecule of interest, no more than one functionality value from M 



different chemical functionality values; 

(c) approximating the surface shape and chemical reactivity of a 
chemically active surface having a volume equal to V by subtracting a 
number V/R3 cubes of dimension R from a surface, where each cube space 
shares at least one face with another cube space and where N cube 
spaces have one of M different chemical functionality values; 

(d) calculating an attraction value K for each molecule of interest 
to the chemically active surface; and 

(e) calculating a list of overall attraction values to the 
chemically active surface. 

USE - For analyzing molecule and protein diversity, 
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ABSTRACT 

PURPOSE: To refine data for similarity discrimination by considering 
noticed attributes as elements , multiplying predetermined weighting 

values by respective importance degrees and adding multiplied results 
to the attribute sets of the elements for the respective noticed 
attributes forming pairs with the importance degrees of a value equal to 
or more than a predetermined value . 

CONSTITUTION: For the data 1 for the similarity discrimination, the data 
for the similarity discrimination before refining stored in a data base are 
expressed in a chart form and the attribute sets 3 which are the sets of 
the pairs of the attributes and the importance degrees are made 
correspond to the elements 2. The attributes for which the importance 
degree of the attribute set of the element A is equal to or more than 
0.5, that are the attribute B and the attribute C, are defined as the 
noticed attributes 4 . The attribute B and the attribute C which are the 
noticed attributes are respectively considered as the elements and the 
corresponding attribute sets are searched from the data for the 
similarity discrimination. The importance degrees are multiplied by 0.8 
which is a specified value 5 and 0.5 which is the value 6 and added to 
the attribute set of the original element A, the attribute set of the 
element A is newly defined. A similar operation is repeated for the 
elements other than the element A and storage in the data base is 
performed. 
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ABSTRACT 

PURPOSE: To enable retrieval with similarity between documents in the 
case of document retrieval by calculating the similarity and difference 



mutually between documents corresponding to the attribute set of the 
document . 

CONSTITUTION: A document attribute value condition input means 6 inputs 
the different point or equal point of a document presented at present by a 
user. A document attribute type condition input means 7 designates the 
retrieval corresponding to the type of an attribute to be conscious of as 
the different or equal point. The user designates the document presented at 
present as the document provided with the close attribute by using a 
document adjacent request input means 8. Corresponding to these retrieval 
conditions, the document is retrieved by a similarity /difference 
retrieving means 11 while using the information calculated by a similarity 
calculating means 9 and a difference calculating means 10. 
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ABSTRACT 

PURPOSE: To attain accurate and speedy recognition even when the contrast 
of an input image is low by extracting brightness information on a scanning 
straight line specified in the input image, and performing the processing 
for improving its image quality, and extracting the start point and end 
point of a pattern from the change point of the improved brightness 
information . 



CONSTITUTION: An image 20 which includes a mark 19 on a circuit board 18 to 
be recognized is inputted from a TV camera 17 which is an image pickup 
device. The hatched part of the image 20 is a high-brightness area and 21 
indicates a noise pattern. The scanning straight line 23 is specified in 
the image 20 and the brightness information on the scanning straight line 
is extracted. The image quality improvement processing such as noise 
removal and image emphasis is carried out for the extracted brightness 
information and the start point and end point of the pattern are extracted 
from the change point of the improved brightness information to obtain a 
group of candidates for the image pattern corresponding to the 
object to be recognized. The similarity between the pattern widths and 
center positions of those candidate patterns and the widths and positions 
of the object to be recognized is used as a criterion to identify the 
image pattern corresponding to the object to be recognized. The 
position is measured to measure the position of the object to be measured. 
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Database evaluation system for helping consumers and business users to 

find required items in database of computers 
Patent Assignee: BIZRATECOM (BIZR-N) 
Inventor: SCHMITT M 

Number of Countries: 001 Number of Patents: 001 
Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

US 5983220 A 19991109 US 956812 P 19951115 199954 B 

US 96748944 A 19961114 
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US 5983220 A 54 G06F-017/30 Provisional application US 956812 

Abstract (Basic) : US 5983220 A 

NOVELTY - A proximity searcher user interface is coupled to 
evaluation engine for displaying reference item from the database 
(2.4). The searcher user interface also displays nearest neighbor item 
for attribute as a function of distance between reference item and 
nearest neighbor item, for at least one attribute of domain model 
(2.10) . 

DETAILED DESCRIPTION - An evaluation engine couples domain model to 
the database, and provides a user interface (2.16) for allowing user 
to iteratively set criterion for selecting and displaying a set of 
matching items comprising a short list. The evaluation engine allows 
user to inspect, compare or navigate the items on short list. A scoring 
interface displays relative score of each item from short list. A 
direct manipulator performs weighting of relative weight of 
attribute of item . The evaluation engine redetermines relative 
score of each item in short list according to any change in relative 
weighting of attributes. 

USE - For helping consumers and business users to find items in 
computer database that most closely matches their objective 
requirements and subjective preferences in network environment. 

ADVANTAGE - Supports analysis and evaluation of similarity of 
items in database with respect to multiple criteria, hence database of 
information rich items can be turned into an interactive buyer's guide. 

DESCRIPTION OF DRAWING (S) - The figure shows software component of 
database evaluation system. 

Database (2.4) 

Domain model (2.10) 

User interface (2.16) 
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internet 
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Abstract (Basic) : US 5926812 A 

NOVELTY - A set of document extract entries are extracted from 
corresponding document set . Each entry comprises a weighted word 
histogram for corresponding document . A set of word clusters 
comprising cluster word list, total distance matrix and connection 
matrix are generated from the entries. A similarity degree is 
determined between the word clusters. 

DETAILED DESCRIPTION - An INDEPENDENT CLAIM is also included for a 
method for determining relevance of document contents. 

USE - For searching hypertext documents on internet. 

ADVANTAGE - Provides fast and accurate searching to identify 
documents of interest to particular user or users without any need for 
the user or users to specify the search criteria. Actively and 
automatically alerts the user of local information related to 
present work. 

DESCRIPTION OF DRAWING (S) - The figure shows schematic 
representation of data structure of total distance matrix and 
connection matrix. 
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Abstract (Basic) : US 5893092 A 

NOVELTY - The selected text is grouped and are ranked according to 
relevancy. Based on a manual determination of relevancy, a feed back 
information is applied to create a different query, automatically to 
form a second rank list . 

DETAILED DESCRIPTION - A sentence, phrase or semantic unit of a 
text in a document is selected from a database collection by a 
natural language search query. The second rank list is of a different 
ranking order. The procedure of ranking the second group is the same as 



that of the first group. 

USE - In personal computers for searching internal files, for modem 
search systems. Applies to retrieve and filter documents such as 
patents, legal documents, medical documents, articles, journals as per 
search request. For answering questions from general information 
database of public affairs office. 

ADVANTAGE - The reading time is minimized and the user is allowed 
to make relevant decisions very easy by just indicating by a key 
stroke whether a document is relative or not. The sentences saves 
the user time by forcing the user to discover small units which are 
relevant or not relevant and enhances quality of search. There is no 
size limit for the number of documents to be searched. Relevancy 
feedback helps the user to automatically identify alternative words 
useful for expressing a query. Provides an automated retrieval system 
which minimizes reading efforts of the user and also minimizes the 
need for highlighting relevant words on a screenful of text. 

DESCRIPTION OF DRAWING (S) - The figure is a flow chart for 
determining the number to indicate the relevance or similarity of a 
document to a query. 
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Abstract (Basic): WO 9749046 A 

The method is used for retrieving information from a collection 



of items each having a corresponding item identifier and each 
being associated by a scalar value with an attribute having a 
corresponding attribute identifier. At least one of the attributes is 
also associated with another of the items in the collection . A data 
processor presents the collection as a map (210) of tuples of item 
identifiers, attribute identifiers, and scalar values . 

The method involves forming a query vector (200) having pairs of 
attribute identifiers and scalar values. The query vector and the map 
is composed to produce a result vector (220) having pairs of item 
identifiers and corresponding scalar values . The latter represent 
the relationship of the query vector and the map for the items having 
the corresponding item values . 

USE - For analysing collection of data items to reveal 
associative structures within collection of data items . 

ADVANTAGE - Allows for calculating distance and/or similarity 
measures to serve as input to statistical techniques. 
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Abstract (Basic) : US 5436830 A 

The process for using a computer to elicit, organize, and link 
different forms of data to automatically generate and present a 
consensus map, involves establishing a series of baseline images in a 
first file in computer memory relating to a research topic, sorting the 
baseline images into groups by a consumer , each of which have 
similar qualities, eliciting and storing sensory and emotional 
perceptions from the consumer regarding the research topic and 
eliciting and storing constructs from the consumer using an interview 
procedure . 

Baseline and additional images are elicited and stored from the 
consumer that correlates closest with the research topic, and 
opposite images from the consumer that represent ideas opposite ideas 
represented by the research topic are also elicited and stored. A 
graphical representation of relationships among the stored images 
and constructs is automatically generated. A consensus map is 
automatically derived from a number of the graphical representations, 
each based on images and constructs of a different one of a number of 
consumers, and the consensus map is visually presented. 

USE /ADVANTAGE - Eliciting from customer important aspects 



associated with particular topic about which marketing program is to be 
devised . 
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Abstract (Basic) : EP 544430 A 

The method for determining frequency of words involves segmenting 
the document image into image units without decoding the image 
content. At least one significant morphological image 
characteristic of selected image units are determined in the document 
image. Equivalence classes of the selected units are identified by 
clustering the selected image units with similar morphological 
image characteristics. 

The image units are quantified in each equivalence class. In the 
identification step, image unit morphological image characteristics 
are correlated using a decision network. 

ADVANTAGE - Frequency determined by solely using visual 
characteristics and without reliance on lexical reference. 
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ABSTRACT 

PURPOSE: To obtain an appropriate clustering result when plural pairs 
having maximum affinities exist by updating the affinities by treating 
the item contained in the most numerous pairs or all of the same 
conditional pairs containing clusters as one cluster. 

CONSTITUTION: Where or not input data are a measurement matrix is 
discriminated and, when they are a measurement matrix, the data are 
converted into inter-item affinities and each item is treated as one 
cluster . Then the number of the clusters is checked and, when clusters 
exist more than two, all of the maximum affinity pairs among clusters are 
extracted. Thereafter, the most numerously contained item or cluster is 
extracted. According to the extracted result, all of the pairs having the 
maximum affinities including the items or clusters , namely, all of 
the pairs having the same condition are treated as one cluster. 
Successively, distance updating is repeated until the number of clusters 
reduced to one. 
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conditions with transmission data, based on which output is generated by 
warning unit 
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Abstract (Basic) : JP 10302006 A 

The apparatus has a transmission data setting unit (11) for setting 
data regarding individual information and desired conditions. The 
set data is transmitted via a data transmission circuit (14). 

A receiving circuit (15) receives data regarding individual 
information and desired conditions, that is transmitted from other 



unit. The received data is compared with the transmission data by a 
judgment unit (18) and a warning unit (19) provides output based on 
judgment result. 

ADVANTAGE - Simplifies selection of companion. 
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Abstract (Basic) : WO 9642172 A 

The method for enabling a user to fetch a specific information 
item from a set of such items in an information processing system 
involves forming an image of a user affinity between the user and the 
information items from a pattern of past interactions between the user 
and the system. The user image is recorded in a user attribute. A 
specific information item is selected by the first agent in further 
dependence on the user attribute. 

Preferably, a set of agents are used within the system, one of 
which is activated in accordance with the user attribute. The user 
affinity image is formed on the basis of the user's cognisance of the 
specific information item. 

USE/ADVANTAGE - System in which information items are presented as 
objects in space and agent guides user through space to propose 
specific objects to user. Adapted to instantaneous needs and 
circumstances of customer. 
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Abstract (Basic) : EP 547990 A 

The group identification method may encompass all users within 
the data processing system, a selected subset of users, or a single 
selected user and his designated affinity users or proxies. 

The group identification is listed within an associated access list 
for a particular object. Upon an attempted access of the particular 
object by a user not listed explicitly within the associated access 
list, whether or not that user is listed within a group 
identification which is permitted access is determined. If the user 
is listed in the group identification, access is permitted. 

ADVANTAGE - Allows affinity users of authorized accessors to 
access an object. Provides improved system of access control. 
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When a surrogate user accesses resources in a distributed system, 
they are entered as members of a user set . The user sets can 
contain other surrogate users or other user sets . The user is 
queried for alocation to a simple user list or surrogate user list. 

In an attempt by a surrogate user to access a resource, the access 
mechanism identifies them by checking the user set and creating a 
user list containing key information and the level of access to be 
granted . 

USE /ADVANTAGE - Allows surrogate users to have audit trail 
maintained and improved access security implemented. ( 6pp Dwg.No.1/1 
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Claims 
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English Abstract 

A powerful, scalable, and reconf igurable image processing system and 
method for processing stereo image data is described. A general purpose, 
reconf igurable engine with toroidal topology, distributed memory, and 
wide bandwidth I/O is described for solving real applications at 
real-time speeds. The reconf igurable image processing system can be 
optimized to efficiently perform specialized computations, such as 
real-time video and audio processing. This reconf igurable image 
processing system provides high performance via high computational 
density, high memory bandwidth, and high I/O bandwidth. Generally, the 
reconf igurable image processing system and its control structure include 
a homogeneous array of 16 field programmable gate arrays (FPGA) and 16 
static random access memories (SRAM) arranged in a partial torus 
configuration (Fig. 46). The reconf igurable image processing system also 
includes a PCI bus interface chip, a clock control chip, and a datapath 
chip. It can be implemented in a single board. It receives data from its 
external environment, computes correspondence, and uses the results of 
the correspondence computations for various post-processing, industrial 
applications. The reconf igurable image processing system determines 
correspondence by using non-parametric local transforms followed by 
correlation. These non-parametric local transforms include the census, 
and rank transforms. Other embodiments involve a combination of 
correspondence, rectification, a left-right consistency check, and the 
application of an interest operator. 

French Abstract 

L f invention concerne un systeme de traitement des images puissant, a 
echelle variable et de type reconf igurable, et un procede de traitement 
des donnees . La machine reconf igurable, de type polyvalent, a topologie 
toroidale, a memoire repartie et a grande largeur de bande en 
entrees/sorties, permet le traitement des applications reelles aux 
vitesses du temps reel. Le systeme reconf igurable de traitement des 
images peut etre optimise efficacement pour les besoins de calculs 
specialises (par exemple, traitement video et audio en temps reel) . Ce 



systeme a un haut rendement grace a sa densite de calcul ainsi que sa 
largeur de bande elevee en memoire et en entrees/sorties. Generalement , 
le systeme et sa structure de commande ont un ensemble homogene de 16 
reseaux de portes programmables par 1 ' utilisateur (ou circuits FPGA) et 
de 16 memoires RAM statiques (SRAM) , en configuration toroidale 
partielle. En outre, le systeme a une puce d 1 interface de bus 
d 1 interconnexion de peripheriques (PCI), une puce de commande d'horloge 
et une puce de trajet de donnees. La mise en oeuvre est possible sur 
carte unique. Le systeme recoit les donnees de 1 1 environnement externe, 
calcule les correspondances et utilise les resultats des calculs de 
correspondance pour differentes applications industrielles de 
post-traitement . Enfin, le systeme determine les correspondances en 
utilisant des transformees locales non parametriques , cette operation 
etant suivie par une correlation. Ces tranformees comprennent les 
transformees de denombrement et de rang. D f autres variantes font 
intervenir en combinaison la correspondance, la rectification, le 
controle d ' homogeneite de gauche a droite et 1 1 application d f un operateur 
de bonification dans 1 ' interet de 1 ' utilisateur . 

...International Patent Class: G06F-017/15 
Fulltext Availability: 
Detailed Description 
Detailed Description 
... illustrative embodiment. 

I . Hamming distances. 

In the preferred embodiment, Hamming distances are used to correlate 
pixels in the reference image with pixels in the other image . The 
Hamming distance of two bit strings is the number of bit positions that 
differ in these two bit strings. 

Correspondence of two pixels can... 

...representative census transformed values will be small. 

Pixels P and Q represent two transformed pixels, where P is a census 
transformed pixel for one input image and Q is a census transformed 
pixel in a search window W(P) for a second input image. The Hamming 
distance between the two transf ormed . . . the window sum for window 334). 
Thus, the window sum for reference element 341 may be calculated 
44 

based on the window sum for reference element 340, by sliding the 
window, adding new values and subtracting old values. 
FIGS. 9 (A) -9(C) illustrate in summary fashion one embodiment of the 
present invention. Again, these figures ignore boundary conditions... 
obtain the correlation data therein for the 'Initial calculation. Each 
next reference image element involves moving over D columns from the 
location of the previous image element . 

Step 722 sets the incrementing variable INCR to 0. This value will be 
used to check for all disparities from D-l to 0 until all correlation sum 
data for a given 44 reference" left image. . . 
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English Abstract 

Disclosed is a method of determining receptor-ligand affinities by 
constructing a receptor protein model, placing a ligand into the binding 
pocket portion of the receptor protein model, calculating the 
protein-ligand interaction energies and then predicting the ligand 
binding affinity for the receptor protein from a mathematical equation. 
This method provides for a way of determining receptor binding affinities 
of potential ligand molecules without the need of actually preparing 
these molecules in a laboratory and testing them by in vitro receptor 
binding assays. Such determinations are useful for identifying potential 
ligands of receptors for use in studying receptor binding, in studying 
receptor activities, as potential modulators of receptor activity, and as 
actual or lead compunds useful as therapeutics which modulate receptor 
activity. 

French Abstract 

Cette invention se rapporte a un procede d' evaluation des affinites 
recepteur-ligand, qui consiste a fabriquer un modele de proteine 
receptrice, a placer un ligand a 1 ' interieur de la poche de liaison du 
modele de la proteine receptrice, a calculer les energies d ' interaction 
proteine-ligand, puis a prevoir l'af finite de liaison du ligand pour la 
proteine receptrice au moyen d'une equation mathematique . Ce procede 
const itue un moyen permettant d'evaluer les affinites de liaison a un 
recepteur des molecules d'un ligand potentiel sans avoir besoin de 
preparer reellement ces molecules dans un laboratoire ni de les tester 
par des methodes in vitro d 1 analyse de liaison a un recepteur. Ces 
evaluations s'averent utiles pour identifier des ligands potentiels de 
recepteurs permettant d'etudier la liaison au recepteur ou les activites 
du recepteur, et utilises en tant que modulateurs de l'activite du 
recepteur et en tant qu 1 agents therapeutiques qui modulent l'activite du 
recepteur . 

Main International Patent Class: G06F-017/50 
Fulltext Availability: 
Detailed Description 

Detailed Description 

... binding proteins in such a family. For this purpose, specific figands 
have been discovered which have a higher affinity for one binding protein 
in a family than for other members . The, disclosed method can be used 
to 

identify potential ligands which are selective for one or a subset of 

members of a binding protein family . Preferably, this is accomplished 
by 

building and refining models of the binding pockets for relevant 
members 

of a binding protein family , and generating predictive equations for 
each model. The affinity of a potential figand molecule can then be 
calculated 

for each binding protein and the affinities compared. Selective ligand 
molecules can be identified as those having a desired pattern of 
affinities 

for the binding protein family members . For example, a ligand might 

be 
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sought having a high affinity for one of the binding proteins but not for 
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English Abstract 

In analyzing a collection of data items to determine data structures, 
the collection of data items is treated as a two-dimensional map. A query 
vector with elements of interest is composed with the map to form a 
result vector. A profile vector formed from the matrix is combined with 
the result vector to form a discrimination vector representing the degree 
of expectation that the elements of the query vector have related to the 
map . 

French Abstract 

Dans 1' analyse d'une collecte de donnees elementaires afin de determiner 
des structures de donnees, la collecte de donnees elementaires est 
traitee en tant que topographie bidimensionnelle . Un vecteur 
d ' interrogation comportant des elements d ! interet est associe a la 
topographie afin de creer un vecteur de resultat. Un vecteur de profil 
cree a partir de la matrice est combine au vecteur de resultat afin 
d'obtenir un vecteur de discrimination representant le degre d'attente 
des elements du vecteur d ' interrogation par rapport a la topographie. 

Main International Patent Class: G06F-017/30 

Fulltext Availability: 
Detailed Description 

Detailed Description 

... In terms of retrieval, the set of tenn vectors may be considered to 
be all of the topics upon which the map focuses, and the set of 
document vectors to be the set of documents that are about each of 
these topics. 

6. Relevance feedback 

A ftifther operation that the invention can perfonn is relevance feedback 
where eross-correlations between elements in a set are discovered to 
enhance queries . . . 
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English Abstract 

A user of a system comprising a large set of information items, for 
example a multimedia database, is assisted by an agent in searching the 
set. The agent has a given affinity for the information items and selects 
a specific information item from the set in conformity with said 
affinity. On the basis of the interactions between the user and the 
system, an image is formed of the affinity of the user for the 
information items. The agent utilizes this image, in addition to its own 
affinity, for the selection of a specific information item. A major 
application of the invention concerns a system in which the information 
items are presented as objects in a space and in which the agent guides 
the user through the space and proposes a specific object to the user. 

French Abstract 

Un utilisateur d'un systeme comprenant un ensemble important d 1 elements 
d ' information, par exemple une base de donnees multimedia, est aide dans 
1 ' exploration de 1' ensemble par un agent. Ce dernier, qui est dote d ! une 
af finite donnee pour les elements d 1 information, selectionne un element 
specif ique d 1 information dans l f ensemble en fonction de ladite af finite. 
A partir des interactions survenant entre 1 ' utilisateur et le systeme, il 
est forme une image de l'af finite de 1 ' utilisateur pour les elements 
d' information. L'agent utilise cette image, venant s'ajouter a la sienne, 
pour selectionner un element specif ique d ' information . Une application de 
premier plan de 1' invention consiste en un systeme dans lequel les 
elements d ' information sont presentes comme des objets dans un espace et 
dans lequel l'agent guide 1 ' utilisateur dans ledit espace et lui propose 
un objet specif ique. 

Main International Patent Class: G06F-017/30 
Fulltext Availability: 
Detailed Description 

Detailed Description 

... To this end, the method in accordance with the invention is 
2 

characterized in that it comprises the step of forming at least once an 
image of a user affinity between the user and the information 
items from a pattern of past interactions between the user and the system 
and recording this image in a user attribute, and. . . 

...of the specific information item by the first agent in further F 
dependence on the user attribute. As a result of the formation of an 
image of the user affinity between the user and the information 



items and the consideration of this user affinity for the selection 
of an information item , the item which best suits the user at the 
relevant instant will be selected from a possibly large group of 
information items . The method in accordance with the invention can be 
used notably to assist the user in searching large sets of 
information 

items such as they occur, for example in multimedia databases 

A version of the method of the invention in which the system comprises a 
set of ... sub-sets in the system can be made. Upon selection of the 
information item, two types of criteria are used. First of all, the 
information item should suit the relevant agent, said instantaneous 
internal state being used to record a degree of affinity between the 
information items and the agent. This type of criterion can be 
implemented as a simple table for the agent, weighting factors being 
included for the various types of attributes of the information items . 
For each relevant information item a score is then determined on the 
basis of the table and the value of the attributes of the information 
item . Secondly, the information item should suit the user. To this 
end, in addition to the attributes of the information item, the 
parameters constituting said user 
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English Abstract 

A method and apparatus for organizing information in a computer filing 
system. The method and apparatus include the creation of a pile (23) 
comprising a collection of documents and displaying a graphical 
representation of the collection of documents. The method and apparatus 
further include browsing the collection of documents (53) by pointing a 
cursor (21) at a particular item in the collection of documents (53) to 
reveal an indicia (50) for the particular item in the collection of 
documents. The filing system can automatically devide a pile (23) (e.g. a 
collection of documents from an electronic mail network) into subpiles on 
the basis of the content of each document in the pile, and the filing 
system, at the user's request can automatically file away documents into 
existing piles in the computer system on the basis of a similarity match 



between the content (or other internal representation) of the document 
and the content (or other internal representation) of existing piles in 
the. computer system. The filing system can also create a pile (23) from a 
sample document by using the internal representation of the document as 
the internal representation of the new pile. The computer filing system 
provides various interfaces in connection with piles to the user of the 
system to provide feedback and other information to the user, including 
information concerning the documents and piles in the computer's filing 
system. 

French Abstract 

L 1 invention concerne un procede et un appareil d ' organisation des 
informations dans un systeme de classement inf ormatique . Le procede et 
cet appareil permettent la creation d'une pile (23) regroupant un 
ensemble de documents et la representation graphique dudit ensemble de 
documents. Lesdits procede et appareil permettent egalement de parcourir 
l 1 ensemble des documents (53) en designant au moyen d'un curseur (21) un 
article particulier dans 1* ensemble de documents (53), afin de produire 
une marque d 1 identification (50) pour 1' article particulier de 1' ensemble 
de documents. Le systeme de classement peut diviser automat iquement une 
pile (23) (par exemple, un ensemble de documents dans un reseau de 
courrier electronique ) en sous-piles selon le contenu de chaque document 
dans ladite pile, et peut, sur demande de 1 ' utilisateur, classer des 
documents dans des piles existant dans le systeme inf ormatique, par 
comparaison du contenu du document (ou d'une autre representation 
interne) au contenu (ou a une autre representation interne) des piles 
existant dans le systeme inf ormatique . Ledit systeme de classement peut 
egalement creer une pile (23) avec un document d'essai en utilisant la 
representation interne dudit document en tant que representation interne 
de la nouvelle pile. Le systeme de classement inf ormatique off re a 
1 ' utilisateur du systeme diverses interfaces avec les piles, ce qui 
permet de lui fournir un retour de 1 ' inf ormation et des informations dont 
certaines concernant les documents et les piles dans le systeme de 
classement inf ormatique . 

Main International Patent Class: G06F-015/62 
Fulltext Availability: 
Detailed Description 

Detailed Description 

. . . which embodies the collective contents of thg 

pile. The vector between a document and a pile may be compared for the 
purpose of determining the relatedness / similarity of the document 
to the pile for purposes of filing or other operations described as part 
of this invention. 
0 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 shows . . . 
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English Abstract 

A text comprehension and retrieval method and apparatus that uses 
semantic analysis (124) of the syntax between letters in two words to 
measure to what degree two words are related. Semantic analysis involves 
assigning numerical values to the letters of a first word and a second 
word based on the dual characteristics of orientation and category 
inherent in the letters, and then analyzing those numerical values to 
identify the semantic relationship of the letters of the first word to 
the letters of the second word. A semantic-matrix (200) assigns weights 
to meaningful letters to allow the application of letter semantic rules 
to convert the concepts represented by the letters of the words to 
numeric values (128) . The numeric values represent how much the first 
word is related to the second word and are used to retrieve text (132) 
from documents having concepts related to user supplied query expression. 



French Abstract 

Procede et appareil de recherche et de comprehension de texte, utilisant 
une analyse semantique (124) de la syntaxe entre des lettres dans deux 
mots pour mesurer le degre de relation entre les deux mots. L 1 analyse 
semantique implique 1 ' affectation de valeurs numeriques a des lettres 
d'un premier mot et d'un second mot en se basant sur les doubles 
caracteristiques d ' orientation et de categorie inherentes des lettres, 
puis 1* analyse de ces valeurs numeriques pour identifier la relation 
semantique des lettres du premier mot avec les lettres du second mot. Une 
matrice semantique (200) affecte des coefficients de ponderation a des 
lettres ayant un sens pour permettre 1 ' application des regies de 
semantique des lettres et convertir les concepts representes par les 
lettres des mots en valeurs numeriques (128). Les valeurs numeriques 
representent le degre de relation entre le premier mot et le second mot 
et elles sont utilisees pour la recherche de texte (132) a partir de 
documents ayant des concepts relatifs a une expression d ' interrogation 
introduite par 1 ' utilisateur . 

Main International Patent Class: G06F-015/38 
Fulltext Availability: 
Detailed Description 

Detailed Description 

... of each block to the words of the query 

expression. CPU 14 then retrieves or marks for retrieval those blocks 
having a letter-semantic relatedness value that equals or exceeds the 
relatedness threshold set by the user . 

The letter-semantic relatedness value is a measure of how 

closely the concepts or meaning presented by the words of the text docu 

ment relate to the concepts or meanings... 

9 
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A method for recommending items to users using automated collaborative 
filtering stores profiles of users relating ratings to items in 
memory. Profiles of items are also stored in memory, the item profiles 



associating users with the rating given to the item by that user . 
Similarity factors with respect to other users are calculated for a 
user , and these similarity factors are used to select a set of 
neighboring users . The neighboring users are weighted based on 
their respective similarity factors, and a rating for an item contained 
in the domain is predicted. In one embodiment, items in the domain have 
features. In this embodiment, the values for features can be clustered, 
and the similarity factors incorporate assigned feature weights and 
feature value cluster weights. 
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English Abstract 

The present invention relates to a method of generating an index (2) to 
a semi-structured database (1). Semi-structured databases contain a 
number of items, each of which is stored as a set of semi-structured data 
including a number of related entries. The presence of these entries are 
determined by comparing the sets of data to a number of selection 
criteria, defining one or more predetermined characteristics of various 
entries . A set of indices is then generated representing a concordance 
between the determined entries and the respective items. 

French Abstract 

La presente invention concerne un procede de production d'un index (2) 
pour une base de donnees semi-structuree (1). Les bases de donnees 
semi-structurees contiennent un certain nombre d' articles dont chacun est 
memorise sous la forme d'un ensemble de donnes semi-structurees 
comprenant un certain nombre d' entrees connexes. La presence de ces 



entrees est determinee en comparant les ensembles de donnees a un certain 
nombre de criteres de selection, definissant une ou plusieurs 
caracteristiques predeterminees d' entrees variees. Un ensemble d* index 
est ensuite produit representant une concordance entre les entrees 
determinees et les articles respectifs. 

Main International Patent Class: G06F-017/30 
Fulltext Availability: 
Claims 

Claim 

. . . been identified as part of the telephone 

number field 4 6. However, the aim is not to produce a 
single rule that will work for all items , but to produce a 

set of rules, each of which will be represented in the 
respective selection criterion, such that when the 

selection criterion is applied to the data, the... and Goods indices are 
derived 

from the free text field. Thus, in the abovementioned 
example, the word "golf" would be determined and placed in 
the Goods slot. Similarly , words relating to payment 
methods, such as "Visa", "Cash" or "Credit Card" would be 
stored in the Payment slot, whilst opening hours are stored 
in the Opening . . . 

...is derived, it may also be advantageous to compare the 
Goods slot search term against the Free Text indices. 
However, as this may locate less relevant records , then the 
system can be configured to perform this search only if 
insufficient records are initially found. 
As far as the a Name index, a... 

. . . from the 

database. Optionally these could also be searched. 

once the indices have been defined, it is preferable 

to further define a set of ranking values indicating how 

relevant an item is to a particular index. This is 
achieved by determining the number of items that would be 
located using one specific index. In general, for the 
majority of indices, if a large number of items would be 
obtained, then each item has a relatively low ranking value 
indicating a relatively low relevance . In contrast, if 
only a small number of items are obtained for a particular 
index, these will have a high ranking value indicating that 
they are very relevant items . 
The situation is further complicated by heading 
entries as each heading entry will refer to a number of 

related items , all of which are relevant . Accordingly, 
indices for heading entries are given a higher ranking 
value than those for the text entries. 
In the case of see-reference entries, the... 

...using the world model 106 if necessary. The 

query constructor then accesses the index store 2 in the 
backend 107 to obtain the location of relevant items within 
the database store 1. Once located, the relevant items are 
transferred back to the dialogue manager 103 which 
determines whether the retrieved items are acceptable. 

Acceptable items are passed on to the processor 100... tree can be found 
to represent the sentence. 

If the result of the parsing process is unsuitable, 
this will have the effect of that no relevant records are 
found. In this situation the dialogue manager 103 will 
detect that the request was unsuitable and therefore 
provide feedback to the user to have... 

...information in the Yellow Pages@ such as ,the,,, "and", 
"address", or "phone number" and searching by these words 



and/or phrases would not help locate relevant records . 
These words and/or phrases are referred to as stop words 
and a record of these is also stored in the lexicon in the 
memory. . . slot 

and-filler request will usually include at least one slot 
that must be filled in. In the present example, it is 
impossible to locate relevant records unless a search term 
is present in the goods slot. Accordingly, if the slot 
filler 108 determines that the goods slot is empty, then it 
returns the request to the dialogue manager 103. The... 
.usual way. 

Ouery Constructor 105 

The query constructor 105 uses the slot-and-f iller 

request to access the backend 107 and determine a number of 

items which appear relevant to the slot-and-f iller request. 
Thus, the query constructor 105 will access indices 
containing the keywords entered in the associated field of 
the slot . . . 



.constructor 105 would access the appropriate indices in the 
index store 2 that include the keywords "plumber", 
"boiler", "Ipswich", and "Visa". 

A list of any relevant items and their respective 
locations within the database store 1 is then returned to 
the query constructor 105 and passed onto the dialogue 
manager 103, which. . . 
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English Abstract 

The present invention relates to a method, a system and a server for 
caching Internet information content. According to the invention, there 
is provided a set of geographically distributed cache servers generally 
serving different geographical regions, wherein Internet information 
derived in relation to the operation of one of said cache servers is 
distributed to essentially all of said servers. 



French Abstract 

L' invention concerne un procede, un systeme et un serveur permettant de 
gerer en antememoire le contenu d 1 informations de l f Internet. Selon 



1' invention, un ensemble de serveurs de gestion en antememoire est 
geographiquement reparti et dessert generalement differentes zones 
geographiques . Les informations Internet obtenues en relation avec 
1 ' exploitation de 1 ' un des serveurs de gestion en antememoire sont 
distribuees a sensiblement tous les serveurs. 

Main International Patent Class: G06F-017/30 
Fulltext Availability: 
Claims 

Claim 

graphically distributed cache servers serving different 
geographical regions but having a common relation, such 
as the relation of serving a culturally and/or linguistically defined 
user group or area, is updated with essen 
tially the same information whenever one of said cache 
servers retrieves information due to an information 
request from an... that the auxili 

ary cache server then provides the same stored content as 
the bypassed serves, thus making sure that the cached 
information is still relevant to the end users in the 
region of the bypassed server. 

Although the description of the invention has been . 
made with respect to the Internet communication system, 
it is . . . shown in 

Fig 1 will now be described. When, for example, a service 
provider 110a in region A receives an information request 
from an end user 120 relating to information provided by 
a content provider 130e located in a region E, which in 
this case lies outside the area 200 {for example, the ... each 
cache server are accurate and up to date, hence providing 
reliable copies of the original sites. 
In the following figures, for ease of description, 

elements having similar functions as those described with 
25 reference to Fig 1 will be designated with the same 
numerals . 

A cache server arrangement forming part of a... 
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English Abstract 

A method for recommending items to users using automated collaborative 
filtering stores profiles of users relating ratings to items . in 
memory. Profiles of items may also be stored in memory,- the item profiles 
associating users with the rating given to the item by that user or 
inferred for the user by the system. The user profiles include 
additional information relating to the user or associated with the 
rating given to an item by the user. Profiles of those users are accessed 
and the ratings are used to calculate similarity factors with respect 
to other users . The similarity factors, sometimes in connection with 
confidence factors, are used to select a set of neighboring users . 
The neighboring users are w eighted based on their respective 
similarity factors, and a rating for an item contained in the domain is 
predicted. An object for providing isolated, hierarchical data storage 
can be used in a method for recommending an item to one of a plurality of 
users. The data object abstracts an associated physical memory element 
and provides an interface for storing data and retrieving data from the 
physical memory element. A system for enabling an information marketplace 
includes a central server which stores data in a memory element. The data 
may or may not be encrypted. Regardless of whether the data is encrypted 
the server may also store a table which associates data elements and 
nodes with an authorization value . If a node requests data for which 
the authorization value in the table gives the node authorization to 
access, the server transmits the data to the node. If the data is 
encrypted, the server may transmit the encrypted data or it may decrypt 
the data for the node before transmission. 



French Abstract 

L' invention concerne un procede servant a recommander des articles a des 
utilisateurs a l'aide de profils d 1 utilisateurs de magasins cooperatifs 
automatises, qui ont trait a des articles stockes dans une memoire. Des 
profils d 1 articles peuvent egalement etre stockes dans la memoire, les 
profils d' articles associant des utilisateurs a une cotation qu ' un 
utilisateur donne attribue a 1' article, ou a une cotation que le systeme 
attribue par deduction a 1 1 utilisateur . Les profils d ' utilisateur 
comportent des informations supplementaires concernant 1 1 utilisateur , ou 
des informations associees a la cotation attribuee par ce dernier a un 
article. Des profils d 1 utilisateurs sont recuperes et les cotations sont 
utilisees pour calculer des facteurs de similitude avec d f autres 
utilisateurs. Les facteurs de similitude, parfois lies a des facteurs de 
confiance, sont utilises pour selectionner un ensemble d ' utilisateurs 
voisins. Les utilisateurs voisins sont ponderes d'apres leurs facteurs de 
similitude respectifs en vue d'obtenir une prevision de cotation pour un 
article faisant partie du domaine considere. Un objet servant a fournir 
un stockage de donnees isolees, hierarchiques peut etre utilise dans un 
procede de recommandation d 1 article a un utilisateur donne. L 1 objet de 
donnees est associe a un element de memoire physique et fournit une 
interface pour stocker et recuperer des donnees de 1' element de memoire 
physique. Un systeme permettant d'activer un marche d ' informations 
comporte un serveur central stockant des donnees dans un element de 
memoire. Les donnees peuvent etre chif frees ou non chif frees; quelles 
qu'elles soient, le serveur peut egalement stocker un tableau associant 
des elements de donnees et des noeuds a une valeur d 1 autorisation . Si un 
noeud demande des donnees pour lesquelles la valeur d 1 autorisation du 



tableau accorde un acces, le serveur transmet les donnees au noeud. Si 

les donnees sont chiffrees, le serveur peut transmettre les donnees 

chif frees ou dechiffrer celles-ci pour le noeud avant de les transmettre. 



Main International Patent Class: G06F-017/30 
English Abstract 

A method for recommending items to users using automated collaborative 
filtering stores profiles of users relating ratings to items in 
memory. Profiles of items may also be stored in memory, the item profiles 
associating users with the rating given to the item by that user or 
inferred for the user by the system. The user profiles include 
additional information relating to the user or associated with the 
rating given to an item by the user. Profiles of those users are accessed 
and the ratings are used to calculate similarity factors with respect 
to other users . The similarity factors, sometimes in connection with 
confidence factors, are used to select a set of neighboring users . 
The neighboring users are weighted based on their respective 
similarity factors, and a rating for an item contained in the domain is 
predicted. An object for providing isolated, hierarchical data... 

...The data may or may not be encrypted. Regardless of whether the data is 
encrypted the server may also store a table which associates data 
elements and nodes with an authorization value . If a node requests 
data for which the authorization value in the table gives the node 
authorization to access, the server transmits the data to... 
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English Abstract 

A method for recommending items to users using automated collaborative 
filtering stores profiles of users relating ratings to items in 
memory. Profiles of items may also be stored in memory, the item profiles 
associating users with the rating given to the item by that user or 
inferred for the user by the system. The user profiles include 
additional information relating to the user or associated with the 



rating given to an item by the user . Similarity factors with 
respect to other users , and confidence factors associated with the 
similarity factors, are calculated for a user and these similarity 
factors, in connection with the confidence factors, are used to select a 
set of neighboring users . The neighboring users are weighted based 
on their respective similarity factors, and a rating for an item 
contained in the domain is predicted. In one embodiment, items in the 
domain have features. In this embodiment, the values for features can 
be clustered, and the similarity factors incorporate assigned feature 
weights and feature value cluster weights. 

French Abstract 

L 1 invention concerne un procede permettant de recommander des articles a 
des utilisateurs grace a un systeme automatise de filtrage cooperatif, 
qui enregistre dans sa memoire des profils d ' utilisateur, etablis sur la 
base des cotes que lesdits utilisateurs attribuent a des articles. On 
peut egalement stocker en memoire des profils d'articles, qui associent 
des utilisateur a la cote donnee a 1' article par 1 ' utilisateur en 
question ou a la cote que le systeme a deduit pour le compte de 
1 1 utilisateur . Les profils d ' utilisateur comprennent des informations 
supplementaires qui portent sur 1 ' utilisateur ou qui sont associees a la 
cote que 1 ' utilisateur a attribuee a un article donnee. On calcule, pour 
chaque utilisateur, des facteurs de similitude par rapport a d'autres 
utilisateurs, ainsi que des facteurs de vraisemblance associes auxdits 
facteurs de similitude, qui sont utilises pour selectionner un ensemble 
d ' utilisateur apparentes. On pondere ces utilisateurs apparentes en 
prenant en compte leurs facteurs de similitude respectifs, et on calcule 
une cote pour un article du domaine concerne. Selon un mode de 
realisation, les articles du domaine concerne sont connus par des 
caracteristiques . Selon ce mode de realisation, les valeurs correspondant 
a ces caracteristiques peuvent etre traitees en grappe, les facteurs de 
similitude integrant des ponderations de caracteristiques affectees et 
des ponderations en grappes des valeurs de caracteristiques. 

Main International Patent Class: G06F-017/60 

English Abstract 

A method for recommending items to users using automated collaborative 
filtering stores profiles of users relating ratings to items in 
memory. Profiles of items may also be stored in memory, the item profiles 
associating users with the rating given to the item by that user or 
inferred for the user by the system. The user profiles include 
additional information relating to the user or associated with the 
rating given to an item by the user . Similarity factors with 
respect to other users , and confidence factors associated with the 
similarity factors, are calculated for a user and these similarity 
factors, in connection with the confidence factors, are used to select a 
set of neighboring users . The neighboring users are weighted based 
on their respective similarity factors, and a rating for an item 
contained in the domain is predicted. In one embodiment, items in the 
domain have features. In this embodiment, the values for features can 
be clustered, and the similarity factors incorporate assigned feature 
weights and feature value cluster weights. 
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English Abstract 

A method of and apparatus for analysing thoughts, perceptions, knowledge 
feelings etc. of an individual. A set of elements are input or selected 
by a user. A single element and a pair of elements are formed. A user 
inputs similar characteristics between the pair of elements and 
difference characteristics between the single element and pair of 
elements. This is performed for a number of iterations and element and 
characteristic combinations. The elements are then ranked by a user in 
relation to each characteristic and the rankings are analysed to 
determine the correlation between elements and characteristics. The 
analysis may be expanded or refined and further elements and 
characteristics may be added at any stage. 

French Abstract 

L' invention porte sur une methode et un appareil d' analyse des pensees, 
perceptions et sentiments de connaissance d'un individu. L ' utilisateur 
introduit ou selectionne un ensemble d' elements. Un element seul ou une 
paire d 1 elements sont ensuite formes. L 1 utilisateur introduit des 
caracteristiques similaires entre la paire d' elements, et des 
caracteristiques differentes entre 1 ' element seul et la paire d' elements. 
L' operation s'effectue pour differentes combinaisons d 1 iterations , 
d 1 elements et de caracteristiques. Les elements sont ensuite classes par 
ordre par 1 ' utilisateur en fonction de chaque caracteristique et les 
ordres de classement sont analyses pour determiner la correlation entre 
les elements et les caracteristiques. L' analyse peut etre etendue ou 
raffinee et de nouveaux elements et caracteristiques peuvent etre ajoutes 
a tous les stades. 



Main International Patent Class: G06F-017/30 
Fulltext Availability: 
Claims 



Claim 

... or selected from a selection of stored qualifiers. 

1 0 10. An apparatus as claimed in claim 7 to 9 wherein the apparatus 
prompts a user to provide similar and dissimilar characteristics for 
a plurality of element groupings . 1 1 . An apparatus as claimed in any 
one of claims 7 to 1 0 wherein a user can select the elements to be 
compared. 1 5 1 2 . An apparatus as claimed in any one of claims 7 to 1 1 
wherein a user may reorder selected data elements into different 
groupings . 

1 3 . An apparatus as claimed in any one of claims 7 to 1 2 wherein the 
characteristics input by a user are stored by ... elements includes means 
responsive to user input for. defining a measurement range. 
1 7 . An apparatus as claim-ed in claim 1 6 wherein a user may enter a 

value within the measurement range for each characteristic in relation 

to each data element . 
1 8 . An apparatus as claimed in claim 1 7 wherein the apparatus forms a 
matrix having the data elements along one axis, the characteristics 



along the other and the values entered forming the matrix. 
1 9. An apparatus as claimed in claim 18 wherein the apparatus compares 
the values for each column of elements and/or each row of 
characteristics to find the closest correlation, forms a new column 
and/or row combining the most closely correlated rows and. . . 
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English Abstract 

An object-oriented programming framework for a case-based reasoning 
(CBR) system shell that permits a user build a case base having Case 
Structure Definitions (104) and Case Instance Definitions (106). The 
case-based reasoning system receives user queries (108) and produces a 
query solution that can be incorporated into the case base. The 
object-oriented framework includes a Session component that controls 
processing of the CBR system, a Control Flow component that manages the 
extension of the categories and classes of the object-oriented framework, 
a Data Store component that stores persistent case structure definitions, 
case instances, and a change log, a Presentation component that manages 
the user interface to the CBR system user, and a Query Engine that 
evaluates a received query against the case base. The case definitions 
and case base descriptions comprise a set of object-oriented classes that 
are organized into an inheritance hierarchy. 

French Abstract 

Un canevas de programmation oriente objet destine a une coquille de 
systeme de raisonnement base sur des cas qui permet a un utilisateur de 
construire une base de cas munie de definitions de structures de cas 
(104) et de definitions de cas (106). Le systeme de raisonnement base sur 
des cas recoit les interrogations de 1 ' utilisateur (108) et produit une 
solution a 1 ' interrogation qui peut etre incorporee a la base de cas. Le 
canevas oriente objet comprend une composante de session qui controle le 
traitement du systeme de raisonnement par cas, une composante de debit de 
controle qui gere 1' extension des categories et des classes du canevas 
oriente objet, une composante de memoire de donnees qui memorise les cas, 
les definitions de structures de cas qui se repetent, ainsi qu ' un journal 
des variations, une composante de presentation qui gere 1' interface 
utilisateur du systeme de raisonnement par cas, et une machine 
d 1 interrogation qui evalue 1 1 interrogation recue par rapport a la base de 
cas. Les definitions de cas et les descriptions de la base de cas 
comprennent un ensemble de classes orientees objet organisees en 
hierarchie par heritage. 
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Claims 



Claim 
... in 

a case definition. 

9 2 . A method as def ined in claim 72, wherein the 

provided framework permits a user to store the query object 

into the case set class, whereupon the stored query object can 

then be retrieved from the case set as the solution to a newly 

defined query object. 

9 3 . A method as def ined in. . . 

. . . supports a 

programming environment, the method comprising the steps of: 

providing a case-based reasoning system that 

operates in the programming environment; and 

evaluating a user query by determining a set of 

case instance descriptions that most . . .match properties 

of a user query and thereby produces a solution to the user 

query; wherein: 

the case instances comprise data structures that 

include properties, values , and attributes; 

the user query specifies a pattern of properties, 

values , and attributes, and is evaluated in a match scoring 
operation that compares the properties, values , and attributes 
of the user query with the corresponding properties, values , 
and attributes of a case instance and computes a match score 
indicating the similarity of the user query and the case 
instance ; and 

the match scoring operation comprises a dynamically 
weighted operation in which weight multiplier values are 
applied to designated properties of the user query and the 
case instance, wherein each weight multiplier value indicates 
an importance ranking of the designated property relative to 
the other properties of the respective user query and case 
instance . 

95 A method as. . . 
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English Abstract 

A computer-based information network for managing credit exposure 
between counterparties to a plurality of credit support agreements. The 
network comprises information storage and processing systems. The systems 
store various types of information including information representative 
of assets of counterparties to a plurality of credit support agreements 
for use in covering credit exposurres therebetween over a specified time 
period, and the plurality of credit support agreements. The systems 
process the information representative of the assets in order to 
effectively reflect a movement of certain of the assets to cover the 
credit exposures over the specified time period. An asset movement 
optimization process is used for determining an optimal movement of 
certain of said assets to cover credit exposures over the specified time 
period. 

French Abstract 

L' invention a trait a un reseau informatique s'articulant autour 
d'ordinateur et destine a gerer des risques de credit entre contreparties 
a plusieurs accords de soutien au credit . Ce reseau comporte des systemes 
de memorisation et de traitement de 1 ' information . Les systemes 
memorisent divers types d ' information dont des renseignements concernant 
des valeurs actives de contreparties a une pluralite d 1 accords de soutien 
au credit a utiliser pour couvrir entre eux des risques de credit courant 
sur une duree specif iee ainsi que les accords de soutien au credit. Les 
systemes traitent 1 1 information concernant les valeurs actives afin de 
rendre compte du mouvement de certaines de ces valeurs actives pour 
couvrir les risques de credit courant sur la duree specif iee. On met en 
oeuvre un processus d ' optimalisation de mouvement de valeur active pour 
determiner un mouvement optimal de certaines de ces valeurs actives pour 
couvrir des risques de credit sur la duree specif iee. 
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Fulltext Availability: 
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Claim 

... the user to select which agreement is to be used as base. Optionally, 
the subprocess allows tile user to pick another account within the same 
customer /account parent structure. This is similar to a file open 
dialogue box where the user can select a different disk from which to 
read files. The user must select which data ... support agreement (in 
memory); and the Event /Trigger is provided by user selection. In the 
illustrative embodiment, this subprocess employs some type of 
hierarchical data relationship between customer accounts and credit 
support agreements associated with the customer accounts. 

Subprocess A214 entitled MAINTAIN AGREEMENT ELIGIBILITY is a GUI process 
I 0 which provides a... the Output thereof is the server request to 
terminate the credit support agreement; I 0 and the Event /Trigger of this 
subprocess is provided by User selection. This subprocess is similar 
to the one that transmits agreement modifications to the other 
counterparty. Subprocess A520 entitled CANCEL EXISTING PLEDGES is a 
server-based process function which allows... 
...to return assets. This subprocess removes the pledge records and 
reverses the debits and credits that they produced. 

Process A600 entitled BROWSER- AGREEMENTS comprise a collection of GUT 
processes which allows customer to peruse several credit support 
agreements and view the details thereof with ease and flexibility. The 
Input to these processes is User ID and the. . .based on the market price 
of credit support assets on a particular day. The Input to this 



subprocess is the I 0 account to be valued and market prices; the 
Output is the current value of assets pledged to this account, and 
the Event/Trigger is provided by various triggers in server and the GUI. 
Prior to capturing credit exposure figures from. . .Move (calculated from 
Threshold, etc.), asset pieces available, Minimum Denomination Amount 

(i.e., if set to ZERO, then can be broken into any size (per asset ))@ 
Roundup type, the value of credit support collateral already 
transferred {after haircuts); Currency of credit exposure; Basic Amount 

(if any),- Independent Amount, Collateral available (including 
information specifying asset pieces ... which occurs during the problem 
solving stage), it will be helpful to keep several points in mind. The 
output of subprocess C360 is an information set constituting the Asset 

Movement Optimization Model of the illustrative embodiment. In general, 
this Optimization Model comprises two components, namely: (i) an 
objective function to be minimized or maximized. . . 

.is to find the amount of each asset type k that must be moved from 
counterparty i to counterparty j, such that the total market value of 
assets transferred among all pairs of counterparties (participating in 
the Optimization Process) is minimized, while satisfying the system of 
constraints represented by: the availability of assets... 

.convenience of computing the variables and the coefficients of these 
objective functions and constraints are arranged in the form of "sets of 
matrices". Collectively, these " sets of matrices" comprise the Asset 
Movement ...fall into either of two 1 5 categories: One-to-Many Transfer 
and Many-to-Many Transfer. One-to-Many Transfer subproblems consider 
transfers of assets from sets of counterparties who are obligated to 
provide assets to cover Delivery and/or Return Amounts but who are not 
due to received assets from other counterparties. Many-to-Many Transfer 
subproblems consider transfers of assets from sets of counterparties 
who both must provide assets to cover Deliver and/or Return Amounts and 
also are expected to receive assets from other counterparties. The all a 
e A. The flag Reuse, may take on the values Yes or No, indicating 
whether reuse of assets is permitted under agreement a r= A. Let A 
miDel. and A nifRet. denote the Delivery Amount and the Return Amount for 
the (directed) agreement... 

.any integer value in the range I,.- MarPfiority. However, covering even 

the lowest priority return is considered to be more 
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important than covering the highest priority delivery. 

Assets , Asset Positions, and Pledge Positions 
Let K represent the set of all assets within the accounts within 
GCSS. Let MP k denote the current market value of the minimum 
denomination of asset k 4=- K. Again, it is assumed that all values 
are in terms of a common currency, such as US Dollars or UK Pounds 
Sterling. The amount of a given asset which a particular counterparty. . . 
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Title: Mining fuzzy association rules 

Author: Chan, Keith C.C.; Au, Wai-Ho 

Corporate Source: Hong Kong Polytechnic Univ, Kowloon, Hong Kong 

Conference Title: Proceedings of the 1997 6th International Conference on 
Information and Knowledge Management, CIKM , 97 

Conference Location: Las Vegas, NV, USA Conference Date: 
19971110-19971114 

Sponsor: ACM 

E.I. Conference No.: 48553 

Source: International Conference on Information and Knowledge Management, 
Proceedings 1997. ACM, New York, NY, USA. p 209-215 
Publication Year: 1997 
CODEN: 002176 
Language: English 

Document Type: CA; (Conference Article) Treatment: T; (Theoretical) 
Journal Announcement: 9808W4 

Abstract: A technique called F-APACS which employs linguistic terms to 
represent the revealed regularities and exceptions for mining fuzzy 
association rules is presented. The linguistic representation is useful 
when the rules discovered are presented to human experts for examination. 
The definition of the linguistic terms is based on fuzzy set theory and the 
rules having these terms are called fuzzy association rules. The use of 
fuzzy techniques is considered as one of the key components of data 
mining systems because of the affinity with the human knowledge 
representation. 18 Refs. 

Descriptors: * Knowledge acquisition; Fuzzy sets; Knowledge representation 
; Computational linguistics; Algorithms; Database systems 

Identifiers: Fuzzy association rules; Data mining 

Classification Codes: 

723.4 (Artificial Intelligence); 721.1 (Computer Theory, Includes 
Formal Logic, Automata Theory, Switching Theory, Programming Theory); 723.3 
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Title: Data mining system using fuzzy rule induction 

Author: Maeda, Akira; Ashida, Hitoshi; Taniguchi, Yoji; Takahashi, Yori 
Corporate Source: Systems Development Lab, Hitachi Ltd, Kawasaki, Jpn 
Conference Title: Proceedings of the 1995 IEEE International Conference 
on Fuzzy Systems. Part 5 (of 5) 

Conference Location: Yokohama, Jpn Conference Date: 19950320-19950324 
Sponsor: IEEE 

E.I. Conference No.: 43461 

Source: International Joint Conference on the 4th IEEE International 
Conference on Fuzzy Systems and the 2nd International Fuzzy Engineering 
Symposium IEEE International Conference on Fuzzy Systems v 5 1995. IEEE, 
Piscataway, NJ, USA, 95CH3574 1 . p 45-46 

Publication Year: 1995 

CODEN: PIFSFZ 

Language : English 

Document Type: CA; (Conference Article) Treatment: A; (Applications); T 
; (Theoretical) 

Journal Announcement: 9510W4 

Abstract: Data mining is a technique used to extract nontrivial 
regularities or relationships as a piece of knowledge in databases. This 
technique can provide users with a very powerful tool for exploiting vast 



amount of stored data. This paper introduces a data mining system is 
introduced. The main feature of this system is specially design fuzzy rule 
induction algorithm which extracts useful pattern in databases. Since fuzzy 
logic has the affinity with the human knowledge representation, it 
considered as an essential component of data mining systems. 

Descriptors: *Fuzzy sets; Data processing; Database systems; Data storage 
equipment; Statistics; Data reduction; Knowledge based systems; Neural 
networks; Relational database systems; Backpropagation 
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921 (Applied Mathematics); 723 (Computer Software); 722 (Computer 
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Title: Discovering interesting statements from a database 

Author: Gebhardt, F. 

Corporate Source: Gesellschaft fur Mathematik und Datenverarbeitung mbH 
(GMD) , Sankt Augustin, Ger 

Source: Applied Stochastic Models and Data Analysis v 10 n 1 Mar 1994. p 
1-14 

Publication Year: 1994 

CODEN: ASMAEM ISSN: 8875-0024 

Language: English 

Document Type: JA; (Journal Article) Treatment: T; (Theoretical) 
Journal Announcement: 9502W1 

Abstract: Knowledge discovery aims at extracting new knowledge from 
potentially large databases; this may be in the form of interesting 
statements about the data. Two interrelated classes of problem arise that 
are treated here: to put the subjective notion of 'interesting' into 
concrete terms and to deal with large numbers of statements that are 
related to one another (one rendering the other redundant or at least less 
interesting). Four increasingly subjective facets of ' interestingness 1 are 
identified: the subject field under consideration, the conspicuousness of a 
finding, its novelty, and its deviation from prior knowledge .^A procedure 
is proposed, and tried out on two quite different data sets, that allows 
for specifying interestingness by various means and that ranks the results 
in a way that take s interestingness, (relevance, evidence) as -well as mutual 
relatedness (sirniXarity^ affinity ) into account - manifestations of the 
second and third facets of interestingness in the given data environment 
(Author abstract) 14. 
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methods 
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Journal: IS Audit & Control Journal vol.5 p. 51 
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Publication Date: 1997 Country of Publication: USA 
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Abstract: Data mining is the automated analysis of detailed 

operational customer transaction data for the purposes of discovering 
hidden, unidentified or underlying patterns, trends and inter-relationships 
in order to understand, score or predict future customer, product or 
process behaviour. Some common yet powerful data mining applications 
spanning multiple industries include: market basket analysis or product 
affinity analysis; customer retention/vulnerability; the customer 

acquisition lifecycle; price optimization; risk management; and target 
marketing and segmentation. Proceed with caution if you intend to mine data 
which has been highly aggregated or too summarized. Your results will 
probably be too generalized and certainly average. You must ensure that you 
are mining a data set which has sufficient breadth or detail, and depth of 
history to produce meaningful results. Keep in mind that the absolute 
measure of value to be derived from any information environment lies in an 
organization's ability to leverage this source of consolidated customer 
information into actionable business decisions. Many leading-edge 
customer-focused organizations are indeed turning data mining into 
action to achieve new found growth via customer intimacy. (0 Refs) 
Subfile: C 

Descriptors: business data processing; data analysis; knowledge 
acquisition; marketing data processing; very large databases 

Identifiers: data mining ; operational customer transaction data 
analysis; underlying patterns; market basket analysis; product affinity- 
analysis; customer retention; customer vulnerability; customer acquisition 
lifecycle; price optimization; risk management; target marketing; market 
segmentation; aggregated data; summarized data; consolidated customer 
information; actionable business decisions; customer-focused organizations; 
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Seeking the right search tool — HEAD TO HEAD 

Johnson, Amy Helen 

Windows Magazine , December 1, 1997 , v8 nl2 pl51-153, 
ISSN: 060-1066 

Company Name: Alexa Internet; Knowledge Discovery 
Product Name: Alexa 1.0; Concept Explorer Pro 1.0.6 
Languages : English 
Document Type: Software Review 
Grade (of Product Reviewed) : B; C 

Hardware/Software Compatibility: IBM PC Compatible; 
95; Microsoft Windows NT 

Geographic Location: United States 

Presents reviews of two Internet search utilities for IBM PC c 
with Windows 95 or NT. Includes a favorable beta preview of 
(free) from Alexa Internet (415) and a mixed review of Concept Ex 
1.0.6 ($49) from Knowledge Discovery Systems (650). Explains 
is a browser add-on that does not actually search, but works by p 
starting Web site, after which it gives four affinity site 



Internet search utilities 

2 Page(s) 
Systems 

Microsoft Windows 



ompatibles 
Alexa 1 . 0 
plorer Pro 
that Alexa 
roviding a 
s from its 



database. States that Alexa* s interface is pleasing and unobtrusive, its 
suggestions are on-target, and it does not impede browsing performance. 
Reports that Concept Explorer Pro is a * v learning 11 search tool that 
attempts to formulate an effective search query based on its analysis of 
word combinations in documents that you specify as relevant. However, 
complains that it cannot build a query from scratch and has too many flaws 
and too few conveniences. Includes two screen displays and one products 
summary, (jo) 

Descriptors: Search Engines; Internet; Web Tools; Web Browsers; 
Utility Program; Knowledge-based Expert Systems; Online Searching 
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Author: Shahabi, Cyrus; Zarkesh, Amir M. ; Adibi, Jafar; Shah, Vishal 
Corporate Source: Univ of Southern California, Los Angeles, CA, USA 
Conference Title: Proceedings of the 1997 7th International Workshop on 

Research Issues in Data Engineering, RIDE 1 97 

Conference Location: Birmingham, UK Conference Date: 19970407-19970408 
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E.I. Conference No.: 4 6260 

Source: Proceedings of the IEEE International Workshop on Research Issues 
in Data Engineering 1997. IEEE, Los Alamitos, CA, USA,PR07849. p 20-29 
Publication Year: 1997 
CODEN: 850SAJ 
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Abstract: We propose to detect users navigation paths to the advantage of 
web-site owners. First, we explain the design and implementation of a 
p rofile r which captures client 1 s selected links and pages order, accurate 
page viewing time "and "cache references, usingTa Java based remote agent. 
The information captured by the profiler is then utilized by a knowledge 
discovery technique to cluster users with similar interests.. We 
introduce a novel path clustering method based on the similarity of the 
h istory of user navigatio n. This approach is capable of capturing the 
interests of the user which could persist through several subsequent 
hypertext link selections. Finally, we evaluate our path clustering 
technique via a simulation study on a sample WWW-site. We show that 
depending on the level of inserted noise, we can recover the correct 
clusters by 10%-27% of average error margin. (Author abstract) 25 Ref s . 
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Autonomy of operations combined with decentralised management of data 
has given rise to a number of heterogeneous databases or information 
systems within an enterprise. These systems are often incompatible in 
structure as well as content and hence difficult to integrate. This thesis 
investigates the problem of heterogeneous database integration, in order to 



meet the increasing demand for obtaining meaningful information from 
multiple databases without disturbing local autonomy. In spite of 
heterogeneity, the unity of overall purpose within a common application 
domain, nevertheless, provides a degree of semantic similarity which 
manifests itself in the form of similar data structures and common usage 
patterns of existing information systems. This work introduces a conceptual 
integration approach that exploits the similarity in meta level 
information in existing systems and performs metadata mining on database 
objects to discover a set of concepts common to heterogeneous databases 
within the same application domain. The conceptual integration approach 
proposed here utilises the background knowledge available in database 
structures and usage patterns and generates a set of concepts that serve as 
a domain abstraction and provide a conceptual layer above existing legacy 
systems. This conceptual layer is further utilised by an information 
re-engineering framework that customises and packages information to 
reflect the unique needs of different user groups within the application 
domain. The architecture of the information re-engineering framework is 
based on an object-oriented model that represents the discovered concepts 
as customised application objects for each distinct user group . 
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Conference Title: Proceedings of First Pacific-Asia Conference. Knowledge 
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Abstract: Efficient partitioning of large data sets into homogenous 
clusters is a fundamental problem in data mining . The standard 

hierarchical clustering methods provide no solution for this problem, due 
to their computational inefficiency. The k-means based methods are 
promising for their efficiency in processing large data sets. However, 
their use is often limited to numeric data. In this paper, we present a 
k-prototypes algorithm which is based on the k-means paradigm but removes 
the numeric data limitation whilst preserving its efficiency. In the 
algorithm, objects are clustered against k prototypes . A method is 
developed to dynamically update the k prototypes in order to maximise the 
intra- cluster similarity of objects . When applied to numeric data, 

the algorithm is identical to the k-means method. To assist in the 
interpretation of clusters, we use decision tree induction algorithms to 
create rules for clusters. These rules, together with other statistics 
about clusters, can assist data miners to understand and identify 

interesting clusters. (16 Refs) 
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Publication Date: 1997 Country of Publication: Germany ix+396 pp. 
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Conference Title: Principles of Data Mining and Knowledge Discovery. 
First European Symposium, PKDD '97. Proceedings 

Conference Sponsor: Dept. Comput . Inf. Sci . ; Norwegian Res. Council; 
Norwegian Artificial Intelligence Soc 

Conference Date: 24-27 June 1997 Conference Location: Trondheim, 
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Language: English Document Type: Conference Paper (PA) 
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Abstract: Automatic clustering methods are part of data mining 
methods. They aim at building clusters of items so that similar items 
fall into the same cluster while dissimilar ones fall into separate 
clusters. A particular class of clustering methods are hierarchical ones 
where recursive clusters are formed to grow a tree representing an 
approximation of similarities between items. We propose an interactive 
interface to help the user to interpret the result of such a clustering 
process, according to the item characteristics. The prototype has been 
applied successfully to a special case of items providing nice graphical 
representations (electric load curves) but can also be used with other 
types of curves or with more standard items. (10 Refs) 
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Publisher: Japan-Singapore AI Centre, Singapore 
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Conference Sponsor: Nat. Comput . Board 

Conference Date: 14-17 Nov. 1994 Conference Location: Singapore 
Availability: Japan-Singapore AI Centre, 75 Science Park Drive, CINTECH 
II, Singapore 

Language: English Document Type: Conference Paper (PA) 
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Abstract: Most heterogeneous clinical information systems share a strong 
semantic resemblance in spite of their autonomy and differences in data 
requirements and design. This semantic resemblance can be exploited when 
performing schema integration. We propose a methodology to identify a set 
of generic concepts based on their semantic similarity using qualitative 
parameters such as entities' usage patterns, database structures and users' 
domain knowledge. This is different from the traditional data mining 
methods which have to use the data values of the entities. The generic 
concepts can be seen as a customized schema which is geared to address 
different interpretations of the data by different groups of users in a 
clinical environment. (15 Refs) 

Subfile: C 

Descriptors: deductive databases; distributed databases; knowledge 
acquisition; medical information systems 

Identifiers: generic concept discovery; heterogeneous clinical 
information systems; semantic resemblance; data requirements;" schema 
integration; semantic similarity ; qualitative parameters; entity usage 
patterns; database structures; domain knowledge; data mining methods; 
customized schema; clinical environment 

Class Codes: C7140 (Medical administration); C6160B (Distributed 
databases) ; C6170K (Knowledge engineering techniques) ; C6160K (Deductive 
databases ) 

Copyright 1996, IEE 



43/5/6 (Item 4 from file: 2) 

DIALOG (R) File 2 : INSPEC 

(c) 2003 Institution of Electrical Engineers. All rts. reserv. 

5162077 INSPEC Abstract Number: C9602-6120-04 0 
Title: Near neighbor search in large metric spaces 

Author (s) : Brin, S . 

Author Affiliation: Dept. of Comput. Sci . , Stanford Univ., CA, USA 
Conference Title: VLDB '95. Proceedings of the 21st International 
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Abstract: Given user data, one often wants to find approximate matches in 
a large database. A good example of such a task is finding images similar 
to a given image in a large collection of images . We focus on the 
important and technically difficult case where each data element is 
high-dimensional, or more generally, is represented by a point in a large 
metric space-and distance calculations are computationally expensive. In 
this paper, we introduce a data structure to solve this problem called a 
GNAT (Geometric Near-neighbor Access Tree) . It is based on the philosophy 
that the data structure should act as a hierarchical geometrical model of 
the data as opposed to a simple decomposition of the data that does not use 
its intrinsic geometry. In experiments, we find that GNATs outperform 
previous data structures in a number of applications. (9 Refs) 
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Conference Sponsor: AAAI; AT&T Global Inf. Solutions; GTE Lab.; et al 
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Abstract: Databases often inaccurately identify entities of interest. Two 
operations, consolidation and link formation, which complement the usual 
machine learning techniques that use similarity based clustering to 
discover classifications, are proposed as essential components of KDD 
systems for certain applications. Consolidation relates i dentifiers pr esent 
in a database to a set of real world entities (RWlTsl which are ~not 
uniquely identified in the database. Consolidation may also be viewed as a 
transformation of representation from the identifiers present in the 
original database to the RWEs. Link formation constructs structured 
relationships between consolidated RWE's through identifiers and events 
explicitly represented in the database. Consolidation and link formation 
are easily implemented as index creation in relational database management 
systems. An operational knowledge discovery system identifies potential 
money laundering in a database of large cash transactions using 
consolidation and link formation. (12 Refs) 
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Journal: Lecture notes in computer science, 1997, 1263 265-276 

ISBN: 3-540-63223-9 ISSN: 0302-9743 Availability: INIST-16343; 
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No. of Refs. : 17 ref . 

Document Type: P (Serial); C (Conference Proceedings) ; A (Analytic) 
Country of Publication: Germany; United States 
Language: English 

We consider several basic classes of tolerance relations among objects. 
These (global) relations are defined from some predefined similarity 
measures on values of attributes. A tolerance relation in a given class of 
tolerance relations is optimal with respect to a given decision table A if 
it contains only pairs of objects with the same decision and the number of 
such pairs contained in the relation is maximal among all relations from 
the class. We present a method for (sub-) optimal tolerance relation 
learning from data (decision table) . The presented method is based on rough 
set approach. We show that for some basic families of tolerance relations 
this problem can be transformed to a relative geometrical problem in a real 
affine space. Hence geometrical computations are becoming useful tools for 
solving the problem of global tolerance relation construction. The 
complexity of considered problems can be evaluated by the complexity of the 
corresponding geometrical problems . We propose some efficient heuristics 
searching for an approximation of optimal tolerance relations in considered 
families of tolerance relations. The global tolerance relations can be 
treated as patterns in the cartesian product of the object set. We show how 
to apply the relational patterns (global tolerance relations) in 
clustering and classification of objects . 

English Descriptors: Artificial intelligence; Learning algorithm; 
Relational database 

French Descriptors: Intelligence artif icielle; Algorithme apprentissage; 
Base donnee relationnelle 

Classification Codes: 001D02C02 
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IDENTIFYING NO. : 9988876 AGENCY CODE: NSF 

CNPq: IMiMD- Indexing and Data Mining in Multimedia Databases 

PRINCIPAL INVESTIGATOR: Faloutsos, Christos 

PERFORMING ORG.: Carnegie-Mellon University, Computer Science, Pittsburgh 
, PA 15213-3815 

PROJECT MONITOR: Zemankova, Maria 

SPONSORING ORG.: National Science Foundation, IIS, 4201 Wilson Boulevard 
, Arlington, Virginia 22230 

DATES: 20000915 TO 20010831 FY : 2000 FUNDS: $300,000 (300000) 
SUMMARY: This is a joint effort with Prof. Caetano Traina from the 
University of Sao Paulo, Brazil. It strengthens the existing collaboration 
between Prof. Christos Faloutsos at CMU and Prof. Traina and his group, 
which has already contributed fast indexing methods for metric and video 
datasets. CMU brings expertise in video indexing (the Informedia DL-II 
project), in power laws, and in data mining . The benefit of the 

collaboration will be faster methods for indexing multimedia and metric 
datasets, and for finding patterns in such collections . This project 
focuses on indexing multimedia data and on developing new tools to find 



f 

' patterns and correlations in such data. Multimedia objects can often be 
^ mapped to n-dimensional points through feature extraction. If not, then 
they can be treated as metric data, when we are provided a pair-wise 
distance function. The methods will be applicable to multimedia, metric and 
spatial data alike. Typical questions include: "find video clips similar to 
a given video clip"; "how strong is the correlation (or anti-correlation) 
between the locations of schools and the locations of libraries?"; "how 
many schools are within 5 miles from libraries?". For indexing, the goals 
are (a) to provide formulas to estimate the selectivities for similarity 
queries and (b) to build faster searching structures. Preliminary joint 
work showed that the distribution of distances in spatial and metric 
datasets often follows a "power-law", which are useful to design better 
search strategies. For data mining , the goals are to provide tools for 
detection of spatial correlations and to develop fast visualization 
algorithms for spatial and multimedia datasets. The developed tools will be 
able to show whether there are clusters in a dataset, how many they are, 
and whether two groups of points (e.g. "schools" and "libraries") are 
"attracting" or "repelling" each other. 
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03522919 E.I. Monthly No: EI9212149288 

Title: Symbolic clustering using a new similarity measure . 

Author: Gowda, K. Chidananda; Diday, E. 

Corporate Source: S J Coll of Eng, Karnataka, India 

Source: IEEE Transactions on Systems, Man and Cybernetics v 22 n 2 
Mar-Apr 1992 p 368-378 
Publication Year: 1992 
CODEN: ISYMAW ISSN: 0018-9472 
Language: English 

Document Type: JA; (Journal Article) Treatment: T; (Theoretical) 
Journal Announcement: 9212 

Abstract: A hierarchical, agglomerative, symbolic clustering methodology 
based on a similarity measure that takes into consideration the 
position, span, and content of symbolic objects is proposed. The 
similarity measure used is of a new type in the sense that it is not 
just another aspect of dissimilarity. The clustering methodology forms 
composite symbolic objects using a Cartesian join operator when two 
symbolic objects are merged. The maximum and minimum similarity values 
at various merging levels permit the determination of the number of 
clusters in the data set . The composite symbolic objects representing 
different clusters give a description of the resulting classes and lead 
to knowledge acquisition. The algorithm is capable of discerning clusters 
in data sets made up of numeric as well as symbolic objects consisting of 
different types and combinations of qualitative and quantitative feature 
values. In particular, the algorithm is applied to fat-oil and 
microcomputer data. 25 Refs. 

Descriptors: *DATA PROCESSING--*Data Structures; COMPUTER PROGRAMMING — 
Algorithms; GRAPHIC METHODS; MATHEMATICAL TECHNIQUES --Numerical Analysis; 
COMPUTERS, MICROCOMPUTER; OILS AND FATS 

Identifiers: SYMBOLIC CLUSTERING; SIMILARITY MEASURES 

Classification Codes: 

723 (Computer Software); 921 (Applied Mathematics); 822 (Food 
Technology); 804 (Chemical Products) 

72 (COMPUTERS & DATA PROCESSING); 92 (ENGINEERING MATHEMATICS); 82 
(AGRICULTURE & FOOD TECHNOLOGY) ; 80 (CHEMICAL ENGINEERING) 
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Title: Knapsack formulation of image matching. 

Author: Daskalakis, T. N.; Daskalakis, C. N. 

Conference Title: Proceedings of the 1991 International Conference on 
Acoustics, Speech, and Signal Processing - ICASSP 91 

Conference Location: Toronto, Ont, Can Conference Date: 19910514 
Sponsor: IEEE Signal Processing Soc 
E.I. Conference No.: 16306 

Source: Proceedings - ICASSP, IEEE International Conference on Acoustics, 
Speech and Signal Processing v 4. Publ by IEEE, IEEE Service Center, 
Piscataway, NJ, USA (IEEE cat n 91CH2977-7). p 2365-2368 

Publication Year: 1991 

CODEN: IPRODJ ISSN: 0736-7791 ISBN: 0-7803-003-3 
Language: English 

Document Type: PA; (Conference Paper) Treatment: T; (Theoretical); A; 
(Applications ) 

Journal Announcement: 9206 

Abstract : A technique is described to solve one of the problems 
encountered in an image analysis system, namely, region matching. The 
approach adopted is that of a medium level process. The information 
gathered during a segmentation stage is used to transform the original 
images into a collection of two- valued ones and a collection of 
gray-level histograms. Mathematical morphology is then used to define a 



measure of similarity on which the process is based. Correspondences are 
* established between the segments according to a number of principles which 
are presented. A well behaved and efficient algorithm is proposed for the 
solution of the problem. 17 Refs. 

Descriptors: * IMAGE PROCESSING--*Image Analysis; MATHEMATICAL TECHNIQUES 
--Numerical Methods 

Identifiers: SHAPE ANALYSIS; MOTION ANALYSIS; IMAGE MATCHING 
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723 (Computer Software); 741 (Optics & Optical Devices); 921 (Applied 
Mathematics) 
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(ENGINEERING MATHEMATICS) 
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Title: Heuristics to locate the best document set in information 
retrieval systems. 

Author: Lucarella, D. 

Corporate Source: Univ -degli Studi di Milano, Milan, Italy 

Conference Title: Eighth Annual International Phoenix Conference on 

Computers and Communications - 1989 Conference Proceedings 

Conference Location: Scottsdale, AZ, USA Conference Date: 19890322 
Sponsor: IEEE, Communications Soc, New York, NY, USA; IEEE, Computer Soc, 

Los Alamitos, CA, USA; Arizona State Univ, Tempe, AZ, USA 
E.I. Conference No.: 12466 

Source: Eighth Annu Int Phoenix Conf Comput Commun 1989 Conf Proc . Publ 
by IEEE, IEEE Service Center, Piscataway, NJ, USA. Available from IEEE 
Service Cent (cat n 89CH2713-6) , Piscataway, NJ, USA. p 567-571 

Publication Year: 1989 

Language: English 

Document Type: PA; (Conference Paper) Treatment: X; (Experimental) 
Journal Announcement: 9001 

Abstract: The use of best-match search strategies in information 
retrieval systems is discussed. In response to a given query, best-match 
searching requires the identification of those documents in the 
collection which are most similar to the query, with similarity being 
measured by an appropriate closeness function. The emphasis is on 
heuristics to efficiently locate the closest documents set . The problem 
is introduced with reference to a straightforward search procedure that 
returns the best documents manipulating inverted index entries. An improved 
algorithm is presented which computes in advance an upper bound on 
closeness, avoiding the exact computation of closeness in many instances 
and thus optimizing both the number of documents to be evaluated and the 
number of inverted lists to be inspected. The algorithm is analyzed, and 
experimental results are reported. 17 refs. 

Descriptors: * I N FORMAT I ON RETRIEVAL SYSTEMS; COMPUTER PROGRAMMING — 
Algorithms; SYSTEMS SCIENCE AND CYBERNETICS — Heuristic Programming 
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Publication Year: 1984 

CODEN: PTNRA8 ISSN: 0031-3203 

Language: ENGLISH 

Journal Announcement: 8411 

Abstract: A relational model for describing three-dimensional objects 
has been designed and implemented. An important use of the model is to 
characterize the similarity and differences between three-dimensional 
objects. Toward this end, was defined a measure of relational similarity 

between three-dimensional object model and a measure of feature 
similarity , based only on Euclidean distance between attribute-value 
tables. A series of computer test compares the results of using the two 
different similarity measures and concludes that the relational 
similarity is much more powerful than the feature similarity and should be 
used when grouping the objects in the database for fast access. 50 
ref s . 
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As the number of servers grows rapidly, it becomes difficult to search 
information in the Internet. To broadcast requests to all servers will 
overwhelm the underlying networks. Moreover, most requests are sent to 
irrelevant servers. 

To determine relevant servers for user queries, we propose the 
client-directory-server model. In this model, a user sends a query to the 
"directory", which ranks servers based on their relevance to the query. 
Users are allowed to search information containing exact keywords or 
embedded concepts. 

To search information by exact keywords, we propose a new Boolean 
similarity measure to rank servers with respect to Boolean queries . In 
contrast with other known method, our method reduces time and space 
complexity from exponential to polynomial in the number of Boolean terms. 
To search information by conceptual meanings, we integrate latent semantic 
indexing and hierarchic agglomerative clustering methods. We cluster 
objects based on their conceptual meanings and arrange them in a 
hierarchic structure to reduce searching time. In addition, we develop a 
new visualization scheme which displays the relationships between query 
terms and documents in a two-dimensional space. 

In this research, we describe our proposed methods and a prototype 
user interface Vintage. We conduct experiments on the USC Homer database 
and four standard document collections, CACM, CISI, CRAN, and MED, for 
which queries and relevant judgments are available. We compare our 
performance with existing methods and obtain better results in precision, 
recall, and space and time complexity. 
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This dissertation examines the use of adaptive methods to 
automatically improve the performance of ranked text retrieval systems. The 
goal of a ranked retrieval system is to manage a large collection of text 

documents and to order documents for a user based on the estimated 
relevance of the documents to the user's information need (or query). 
The ordering enables the user to quickly find documents of interest. Ranked 
retrieval is a difficult problem because of the ambiguity of natural 
language, the large size of the collections, and because of the varying 
needs of users and varying collection characteristics. 

We propose and empirically validate general adaptive methods which 
improve the ability of a large class of retrieval systems to rank documents 
effectively. Our main adaptive method is to numerically optimize free 
parameters in a retrieval system by minimizing a non-metric criterion 
function. The criterion measures how well the system is ranking documents 
relative to a target ordering, defined by a set of training queries which 
include the users 1 desired document orderings . Thus, the system learns 
parameter settings which better enable it to rank relevant documents 
before irrelevant. The non-metric approach is interesting because it is a 
general adaptive method, an alternative to supervised methods for training 
neural networks in domains in which rank order or prioritization is 
important. A second adaptive method is also examined, which is applicable 
to a restricted class of retrieval systems but which permits an analytic 
solution . 

The adaptive methods are applied to a number of problems in text 
retrieval to validate their utility and practical efficiency. The 
applications include: A dimensionality reduction of vector-based document 
representations to a vector space in which inter-document similarity more 
accurately predicts semantic association; the estimation of a similarity 
measure which better predicts the relevance of documents to queries; 
and the estimation of a high-performance neural network combination of 
multiple retrieval systems into a single overall system. The applications 
demonstrate that the approaches improve performance and adapt to varying 
retrieval environments. We also compare the methods to numerous alternative 
adaptive methods in the text retrieval literature, with very positive 
results . 



25/5/9 (Item 5 from file: 35) 

DIALOG (R) File 35 : Dissertation Abs Online 

(c) 2003 ProQuest Inf o&Learning . All rts. reserv. 

1012178 ORDER NO: AAD88-08196 

COMMUNICATION AND ORGANIZATIONAL IMAGE: A STUDY OF THE SHARED KNOWLEDGE OF 
AN ORGANIZATION 

Author: TREADWELL, DONALD FRANCIS 
Degree: PH. D 
Year: 1987 

Corporate Source/Institution: RENSSELAER POLYTECHNIC INSTITUTE (0185) 
Source: VOLUME 49/04-A OF DISSERTATION ABSTRACTS INTERNATIONAL. 

PAGE 659. 217 PAGES 
Descriptors: SPEECH COMMUNICATION; PSYCHOLOGY, SOCIAL 
Descriptor Codes: 0459; 0451 

An empirical study of the images of an organization held by its 



members and publics was undertaken to assess the relationships between 
image, communication, and commitment to the organization. Image, or 
subjective knowledge, of an organization is thought to develop out of 
communication between members, and to account for their commitment to the 
organization. Similarly, members are expected to be more committed to their 
organization if their images of it are both positive and similar. Such 
notions, although implicit in organizational literature, have received 
little empirical attention; neither have non-members' images of an 
organization been assessed in this context. 

To investigate the relationships between image, communication 
activity, and commitment, data were collected from members of a four-year 
college. To explore non-member images of the college as a function of 
communication with it, data were also obtained from two external 
groups--guidance counselors and high school seniors. 

Statements describing the college were obtained from interviews with 
both campus members and non-members. Image was operationalized as 
individuals' rank-ordering of these statements, using a Q-sort procedure. 
Nine major image dimensions or themes were obtained by clustering 
statements on the basis of rank order scores assigned to each statement by 
respondents. Two types of dimension were identified — idealistic/abstract 
and pragmatic/decisional . An image-similarity score, which indexes the 
extent to which an individual's ranking of statements was similar to those 
of other members of the college, was also computed for each respondent. 

For students , a significant positive correlation was found between 
image similarity and self-reported measures of communication. A 
positive correlation between image similarity and commitment was also found 
for students , and for some employee groups . Highly committed members 
saw the college in idealistic, goal-oriented terms, whereas less committed 
members saw the college in pragmatic or problematic terms. For student 
groups particularly, the analyses indicate positive relationships between 
communication, commitment, and perceiving the organization in idealistic 
terms . 

Non-members with images closest to those of members use communication 
contacts that appear to provide relatively limited information, suggesting 
that access to alternate information about an organization limits the 
extent to which non-members achieve similarity of image with its members. 
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The experiments involved 11,571 abstracts (with titles), 1,000 keyword 
stems and 93 search requests. Measures of word association are derived in 
several ways from the numbers of documents in which two given words 
co-occur, and measures of similarity from the numbers of words 
associated with both. Word clusters with different degrees of overlap are 
derived from the resulting networks of word connections for use as document 
descriptors. All are employed in retrieval and their performance analyzed. 
Two new measures, sensitivity and coverage, reflect the variation in a 
strategy's performance from request to request. The best strategy depends 
on the user's requirements. For a single strategy, key-words are simplest 
but the quantities of output are erratic and may usefully be controlled 
according to word associations. If two strategies can be used, key-words 
alone may be followed by associations, yielding in a similar output 
quantity 30% more relevant documents . The corresponding use of 



clusters is marginally better but unlikely to justify its extra cost. 
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Conference Title: Image Analysis and Processing. 9th International 
Conference, ICIAP 1 97 Proceedings Part vol.1 p. 311-18 vol.1 
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Publisher: Springer-Verlag, Berlin, Germany 

Publication Date: 1997 Country of Publication: Germany 2 vol. 

(xxii+722+794) pp. 
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Conference Title: Proceedings of ICIAP 97. 9th International Conference 
on Image Analysis and Processing 

Conference Sponsor: IAPR 

Conference Date: 17-19 Sept. 1997 Conference Location: Florence, Italy 
Language: English Document Type: Conference Paper (PA) 
Treatment: Theoretical (T) 

Abstract: This paper presents an unsupervised texture segmentation 
method, which uses distributions of local binary patterns and pattern 
contrasts for measuring the similarity of adjacent image regions during 
the segmentation process. The nonparametric log-likelihood test, the G 
statistic, is engaged as a pseudo-metric for comparing feature 
distributions. A region-based algorithm is developed for coarse image 
segmentation and a pixelwise classification scheme for improving 
localization of region boundaries. The performance of the method is 
evaluated with various types of test images . The same set of parameter 
values is used in all the experiments with texture mosaics in order to 
demonstrate the robustness of our approach. (18 Refs) 
Subfile: B C 
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Language: English Document Type: Journal Paper (JP) 
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Abstract: A method is proposed whereby objects in gray-scale images can 
be classified automatically according to their arc-approximated contour 
representations. Their similarity measure is determined by the one's 
complement of the absolute difference between the autocorrelation 
coefficients of sections of the approximated boundaries of the template and 
test objects . Assignment to a particular cluster is done using the 
threshold values determined by the probability integral. (23 Refs) 

Subfile: B C 

Descriptors: approximation theory; computer vision; correlation methods; 
edge detection; image matching; image representation; image segmentation; 
object recognition 

Identifiers: automatic pattern classification; gray-scale-image objects; 
autocorrelation; similarity measure ; arc-approximated contours; image 
representations; approximated boundaries; cluster; probability integral 

Class Codes: B6140C (Optical information, image and video signal 
processing); B0290F (Interpolation and function approximation); C5260B ( 
Computer vision and image processing techniques); C1250 (Pattern 
recognition); C4130 (Interpolation and function approximation) 

Copyright 1996, IEE 



25/5/18 (Item 5 from file: 2) 

DIALOG (R) File 2 : INSPEC 

(c) 2003 Institution of Electrical Engineers. All rts. reserv. 

4880434 INSPEC Abstract Number: C9503-6170K-100 
Title: A context similarity measure 
Author(s): Biberman, Y. 

Author Affiliation: Dept. of Math. & Comput . Sci., Ben-Gurion Univ. of 
the Negev, Beer-Sheva, Israel 
p. 49-63 

Editor(s): Bergadano, F. ; De Raedt, L. 
Publisher: Springer-Verlag, Berlin, Germany 

Publication Date: 1994 Country of Publication: West Germany xi+4 38 
pp. 

ISBN: 3 540 57868 4 

Conference Title: Machine Learning: ECML-94 . European Conference on 
Machine Learning 

Conference Date: 6-8 April 1994 Conference Location: Catania, Italy 
Language: English Document Type: Conference Paper (PA) 
Treatment: Practical (P) ; Theoretical (T) 

Abstract: This paper concentrates upon similarity between objects 
described by vectors of nominal features. It proposes non-metric measures 

for evaluating the similarity between: two identical values in a 
feature; two different values in a feature; and two objects . The paper 
suggests that similarity is dependent upon the context: it is influenced 

by the given set of objects , and the concept under discussion. The 
proposed context- similarity measure was tested, and the paper presents 
comparisons with other measures. The comparisons suggest that compared to 
other measures , the context- similarity suits best for natural concepts. 
This paper concentrates upon similarity in the context of learning. 
Exemplar based learning models suggest that concepts are learned by 
memorizing examples; the main information the learner needs to store in 
his/its memory is the classified examples the teachers supply; no general 
information in the form of rules is induced; the learner classifies new 
examples by comparing them to stored exemplars. (12 Refs) 
Subfile: C 
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Abstract: The paper describes the retrieval process from image databases, 
based on a partial match between the query and the images. The proposed 
approach allows one to measure the similarity between the query and the 
images in the database and to retrieve those having the highest probability 
to be relevant. The paper describes the query processing and the access 
structures, based on the "signature method". Four levels of signature files 
are associated to the image database and a. signature is associated to the 
query. The query signature is compared with the image signatures in a four 
step image processing algorithm. The result of the process is a set of 

images with an associated recognition degree, measured by using 

information provided by the user during query formulation (such as 
importance of the presence of each object) and by using the image structure 
and the recognition degree associated to each object. The retrieved images 

are presented to the user in decreasing relevance order. The method 
described so far is inefficient, since the selection of most relevant 

images is executed among all relevant images (even those having a low 

relevance ) . The paper presents two approaches for improving the 
efficiency of query processing: by reducing the number of accesses to the 
image database; and by reducing the number of accesses to the signature 
file. The advantages and drawbacks of each method are illustrated. (20 

Refs) 
Subfile: B C 

Descriptors: image recognition; probability; query processing; visual 
databases 
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Journal: Proceedings of the SPIE - The International Society for Optical 
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Abstract: This paper discusses a retrieval scheme for an information 
retrieval system in which the feedback from a number of users of the system 
about its performance (global feedback) is stored in the form of clusters 
called user-oriented clusters. The clusters are described by using the 
description of its constituent documents . The clusters and queries are 
represented as vectors and the measure of similarity between them is 
represented as the cosine of the angle between the two. The clusters are 
retrieved as per decreasing order of similarity with respect to a query. An 
important problem that arises in the context of cluster description is the 
significance of an index term assigned to documents . This problem, 
called term refinement problem, is formulated and solved. The experimental 
results of the proposed retrieval scheme are compared with those of the 
vector space model and the results obtained are encouraging. (18 Refs) 
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Abstract: A content-based indexing . technique is proposed. The image 
features used are derived from the relative spatial relationships among 
internal image entities. The similarity measurement is based on 

causality (probability) which indicates the degree of similarity between a 
user's query and images . The index structure contains a set of 
causality-based similarity trees with nodes connected to an information 
causal net. For a given ( weighted ) query, the initial set of similar 
images is identified via similarity trees and then refined through the 
information causal net. The method is introduced with an example using 
magnetic resonance chest images. (9 Refs) 
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magnetic resonance chest images 

Class Codes: C7330 (Biology and medicine); C6160S (Spatial and pictorial 
databases); C7240 (Information analysis and indexing) 



25/5/22 (Item 9 from file: 2) 

DIALOG (R) File 2 : INSPEC 

(c) 2003 Institution of Electrical Engineers. All rts. reserv. 

02816181 INSPEC Abstract Number: A87028671 

Title: Use of image similarity for the selection or synthesis of 
projections for subtraction radiography 

Author (s): Ruttimann, U.E.; van der Stelt, P.F.; Webber, R.L. 
Author Affiliation: Nat. Inst. of Dental Res., Nat. Inst, of Health, 
Bethesda, MD, USA 

Journal: Proceedings of the SPIE - The International Society for Optical 
Engineering vol.626, pt . 1 p. 301-7 

Publication Date: 1986 Country of Publication: USA 
CO DEN : PSISDG ISSN: 0277-786X 

Conference Title: Application of Optical Instrumentation in Medicine XIV 
and Picture Archiving and Communication Systems (PACS IV) for Medical 
Applications 

Conference Sponsor: SPIE 

Conference Date: 2-7 Feb. 1986 Conference Location: Newport Beach, CA, 
USA 

Language: English Document Type: Conference Paper (PA); Journal Paper 
(JP) 

Treatment: Theoretical (T) 

Abstract: The use of subtraction radiography in dentistry is impeded by 
the necessity to couple physically the X-ray source, the patient and the 
film, in order to achieve a reproducible projection geometry. This need can 
be obviated by the ability to synthesize arbitrary projection images from 
a basis set of projections bearing a known geometric relationship to 
each other. Implementation of this method requires knowledge of the 
projection angle of the desired projection image relative to the basis set. 
This investigation explores the feasibility of using the gray-level 
standard deviations in corresponding subtraction images as similarity 

measures , in order to determine retrospectively the projection angle of a 
radiograph of interest with respect to the set of basis projections. An 

iterative coordinate estimation procedure is developed incorporating this 
technique, and its accuracy is evaluated using radiographs obtained from 
dry skull specimens. (11 Refs) 
Subfile: A 
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ABSTRACT: In order to make a decision or to solve a problem among more than 
two people , it is very important to express their ideas and 
concepts and to discuss their differences. A cognitive map is useful as 
a toll concept representation. Based on individual cognitive maps, they 
can discuss their differences and form their common concensus. For this 
purpose, this paper presents a similarity measure between two 
cognitive maps and difference index between a pair of elements. First, 
this paper formulates the cognitive map in the form of a diagraph and 
introduces a difference measure on the cognitive map. Next, this paper 
shows that the relationship between similarity measure defined by 
the qualitative distance measure can be represented by the difference 
measures among every pairs of elements, (author abst . ) 

DESCRIPTORS: learning; directed graph; education and training; flow graph; 
thinking; matching ( graph) ; logic; flow chart; similarity; 
formulation (mathematics ) 

BROADER DESCRIPTORS: graph; matching; drawing (diagram) ; diagram and table; 
property 

CLASSIFICATION CODE(S): AA04000C 
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ABSTRACT: In order to make a decision or to solve a problem among more than 
two people , it is very important to express their idears and 
concepts and to discuss their differnces. A cognitive map is useful as 
a tool of concept representation. Based on individual cognitive maps, 
they can discuss their differnces and form their common concensus. For 
this purpose, this paper presents a similarity measure between two 
cognitive maps. First, this paper formulates the cognitive map in the 
form of a digraph and introduces an importance measure and a 
qualitative distance measure for a pair of elements on the cognitive 
map. Next, this paper shows that the relationship between similarity 
measure defiend by the qualitative distance measure can be represented 
by the importance measures among every pairs of elements . (author 
abst . ) 

DESCRIPTORS: group activity; problem solving; decision making; directed 
graph; cognitive science; similarity; graph theory; education and 
training; recognition; individual specificity 

BROADER DESCRIPTORS: decision; graph; science; property; mathematics; 
theory; biological comparison; comparison 
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JOURNAL NUMBER: G0337ABS ISSN NO: 0373-4447 

UNIVERSAL DECIMAL CLASSIFICATION: 002.5:005 
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DOCUMENT TYPE: Journal 
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ABSTRACT: In most information retrieval systems or models, the assumption 
is normally made that index terms assigned to the documents of a 
collection occur independently of each other. So as to improve the 
retrieval effectiveness of systems, there is a need to take 
dependencies between certain index term pairs into account. As the 
similarity measure between a query and a document is important in 
quantitative retrieval, two measures, which reflect directly the 
relationships between index terms when they are given by pairwise 
correlations, and proposed in this paper. One of the proposed measures 
is an extension of the cosine function model. This measure is based on 
oblique coordinates whose degree of angle between axes corresponds to 
the pairwise correlation between index terms, in contrast to the 
conventional cosine function measure based on rectangular coordinates. 
The other measure is an extension of the extended Boolean model, which 
was proposed by G. Salton et al . Using these measures, we need no 
assumption of term independence. Retrieval experiments to evaluate the 
proposed measures was performed on a test collection of 623 document 
records and 5 queries, in a weighted mode, in which index terms 
assigned to the document record were weighted, and in an unweighted 
mode. The experiment showed following wesults: 1) it is useful to 
incorporate term dependencies into the similarity measures ; and 2) 
the proposed measures, however, did not have much better effectiveness 
than conventional ones, (author abst.) 
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Abstract: This paper examines the problem of estimating how similar N 
objects are when they are compared with each other. The proposed approach 
uses as data comparative judgements of all possible pairs of the N objects. 
Pairwise comparisons have long been used with success in determining the 
relative importance of individual members in a group of objects . 
In the proposed approach the pairwise comparisons focus on the similarity 
relations instead of the relative importance of each object. A quadratic 
programming model is also proposed. This model processes the 
similarity-based pairwise comparisons and determines the similarity 
relations among the N objects . The proposed quadratic programming model 
has linear constraints; therefore it can be solved easily by transferring 
it into a system of linear equations. (Author abstract) 24 Refs. 

Descriptors: *Data reduction; Linear programming; Mathematical models; 
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Learning involves either the incorporation of new facts into prior 
knowledge or the modification of the old knowledge structure. Therefore, 
the construction and organization of structure of a domain knowledge should 
help the understanding of the student's learning. If the student's 
cognitive structure in memory can be represented externally, then the 
instructor can better assess the student's learning difficulty, and 
remedial instruction can be more effective. 

The purpose of this study was to investigate the relationship of 
high school students ' mechanics misconceptions, achievement, and their 
cognitive structures. Two instruments were used in this study, a Force 
Concept Inventory (FCI) was administered to test students ' misconceptions 
on mechanics, and a similarity rating task developed by the researcher 
was used to derive the students' proximity data on mechanics concepts. The 
sample included high school students (grade 11) in middle Taiwan. 



Similarity ratings were transformed into a network (cognitive structure) 
by the Pathfinder algorithm. Students were divided into three groups 
according to their performance levels on classroom achievement tests. Three 
high school teachers were also asked to rate the similarity pairs, and the 
median of their ratings represents the content /teachers 1 structure. The 
data shows that there were more perceived connections among concepts in 
content/teachers 1 structure than students', and high achievers 1 cognitive 
structures were more similar to that of content /teachers than low 
achievers. In addition, some similarity pairs were found to have predictive 
power to students 1 correct and incorrect responses to FCI items . However, 
data analyses show no significant difference in cognitive structure 
between students with and without misconceptions. 



37/5/39 (Item 1 from file: 202) 

DIALOG (R) File 202: Info. Sci . & Tech. Abs . 
(c) Information Today, Inc. All rts. reserv. 

3001108 

On the creation of hypertext links in full-text documents. Measurement of 
inter-linker consistency. 

Author(s): Ellis, D; Willett, P 

Corporate Source: Univ. of Sheffield, Sheffield, England 

Journal of Documentation vol. 50, no. 2, pages 67-98 

Publication Date: Jun 1994 

ISSN: 0022-0418 

Language: English 

Document Type: Journal Article 

Record Type: Abstract 

Journal Announcement: 3000 

An important stage in the process of retrieval of objects from a 
hypertext database is the creation of a set of inter-nodal links that are 
intended to represent the relationships existing between objects . This 
operation is often undertaken manually, just as index terms are often 
manually assigned to documents in a conventional retrieval system. It is of 
interest to investigate the consistency of assignment of links in separate 
hypertext versions of the same full-text document, since a measure of 
agreement may be related to the subsequent utility of the resulting 
hypertext databases. In this paper, the authors describe the application of 
arithmetic coefficients and topological indices to the measurement of the 
degree of similarity between the sets of inter-nodal links in hypertext 
databases. The paper presents the results of a study in which several 
different sets of links are inserted, by different people, between the 
paragraphs of each of a number of full-text documents . The results show 
little similarity between the sets of links identified by different 
people . This finding is comparable with those of studies of inter-indexer 
consistency, where it has been found that there is generally only a low 
level of agreement between the sets of index terms assigned to a document 
by different indexers. 

Descriptors: Databases; Documents; Full text systems; Hypertext 
Classification Codes and Description: 5.11 (Searching and Retrieval); 6.02 

(Bibliographic Search Services, Databases) 
Main Heading: Information Processing and Control; Information Systems and 
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Book Title: In Waldron, Helen J., Ed.; Long, F. Raymond, Ed. Proceedings Of 
The American Society For Information Science, Volume 10. 36th Annual 
Meeting, Los Angeles, California, October 21-25, 1973. 1973. Greenwood 
Press, Westport, Connecticut. P. 189-190. 7 Ref. Wo 



Author (s): Preece, Scott E 

Corporate Source: Illinois Institute Of Technology, Chicago. 

Publication Date: 1973 

Language: English 
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The use of clustering as a method of ordering results, and the relative 
merits of this use is reported. Cluster-forming systems are of three types: 
redistributing, graphy-theoretic, hierarchic, each with a "level" based on 
the similarity between the entities merged to form it. Items 
clustered may be terms or documents . Clustering can be used in 
addition to, instead of, or in conjunction with both boolean and ranking 
schemes. Post-retrieval clustering will remove many irrelevant documents 
which could otherwise have formed clusters to hide relevant documents 
, and permits the user to restrict the terms used in the clustering to 
those of the request. Economies may be observed in forming the dissimilaity 
and document-term coincidence matrices. Terms used in clustering may be 
weighted, allowing a user to specify each term's influence on the 
association measure. Clustering is useful for controlled-vocabulary data 
bases resulting in closer association between similar documents . 
Post-retrieval clustering aids in removing ambiguities among terms and in 
eliminating some kinds of false drops, and increases the convenience of the 
results for quick scanning. 
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ABSTRACT: Consistency of indices that quantify subjective similarity over 
different objects is important for translation of colloquial 
modifiers as same as selection of the indices. In this paper, 
consistency of relationship between subjective degree of similarity 
for vague objects and 16 mathematical similarity indices of the 
fuzzy sets of the objects is discussed when the vague objects are 
exchanged. First, the relationship between them is examined in an 
experiment abopted verbal expressions of " weight " as vague objects . 
In analysis of correlation and discrepancy between them, it is clear 
that two indices concerning distance between fuzzy sets correspond to 
the subjective degree of similarity closely. Then, comparing these 
results with the results obtained from the former experiment used 
verbal expression of "height" shows the consistency of the relationship 
between them, (author abst . ) 

DESCRIPTORS: fuzzy set; translation ( language ) ; ambiguity; distance; 
similarity; subjective evaluation; human factor 
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In 4 studies, the authors examined antecedents of self-definition as 
either a unique individual (the individual self) or an interchangeable 
group member (the collective self) . Accentuation of perceived similarities 
versus differences among in- group members including the self served as 
the main indicator of participants ' relative emphasis on their 
individual or collective self. Following prior work in the social identity 
and self-categorization theory tradition, the authors predicted and found 
systematic variations in the relative emphasis on the individual or 
collective self. Relative emphasis varied with the valence of temporarily 
salient in-group features, with the more stable or chronic attractiveness 
of one's in-group, and with awareness of special treatment of the in-group 
by the outside world. Finally, issues are discussed concerning the 
cognitive construal of in-groups as well as the role of the individual self 
and the collective self for strategies of social mobility and social 
change . 
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Abstract: Similarity measures are functions which describe the degree 
of likeness between two objects . We propose a method of using 
domain-specific expert knowledge to infer a functional similarity- 
measure between objects in that domain. Using a user interface and a 
collection of exemplar objects , an expert interactively constructs the 
similarity structure of the domain under consideration. From the expert 
rankings and dissimilarity assignments, a vector representation of the 
exemplar objects is found. A neural network is then trained to find a 
similarity measure , which then can be used for indexing and content 
based retrieval. Using this approach, a system for retrieval of simple 
three-dimensional polyhedra is implemented. (Author abstract) 12 Refs. 

Descriptors: ^Information retrieval; Neural networks; User interfaces; 
Knowledge representation; Heuristic methods; Indexing (of information) ; 
Image processing; Expert systems; Interactive computer systems 

Identifiers: Content based image retrieval 

Classification Codes': - 

723.4.1 (Expert Systems) 

903.3 (Information Retrieval & Use); 723.4 (Artificial Intelligence); 
722.2 (Computer Peripheral Equipment); 903.1 (Information Sources & 
Analysis) 

903 (Information Science); 723 (Computer Software); 722 (Computer 
Hardware); 921 (Applied Mathematics) 

90 (GENERAL ENGINEERING); 72 (COMPUTERS & DATA PROCESSING); 92 
(ENGINEERING MATHEMATICS) 



19/5/4 (Item 1 from file: 35) 

DIALOG (R) File 35 : Dissertation Abs Online 

(c) 2003 ProQuest Inf o&Learning . All rts. reserv. 

01610239 ORDER NO: AAD98-09318 

ACQUISITION OF 3D MODELS FROM A SET OF 2D IMAGES (COMPUTER VISION, 
IMAGE MATCHING, THREE-DIMENSIONAL, TWO-DIMENSIONAL) 

Author: CHENG, YONG-QING 
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The acquisition of accurate 3D models from a set of images is an 
important and difficult problem in computer vision. The general problems 



considered in this thesis are how to compute the camera parameters and 
build 3D models given a set of 2D images . 

The first set of algorithms presented in this thesis deal with the 
problem of camera calibration in which some or all of the camera parameters 
must be determined. A new analytical technique is derived to find relative 
camera poses for three images, given only calibrated 2D image line 
correspondences across three images. Then, a general non-linear algorithm 
is developed to estimate relative camera poses over a set of images . 
Finally, the presented algorithms are extended to simultaneously compute 
the intrinsic camera parameters and relative camera poses from 2D image 
line correspondences over multiple uncalibrated images. 

To reconstruct and refine 3D lines of the models, a multi-image and 
multi-line triangulation method using known correspondences is presented. A 
novel non- iterative line reconstruction algorithm is proposed. Then, a 
robust algorithm is presented to simultaneously estimate a model consisting 
of a set of 3D lines while satisfying object-level constraints such as 
angular, coplanar, and other geometric 3D constraints. 

Finally, to make the proposed approach widely applicable, an 
integrated approach to matching and triangulation from noisy 2D image 
points across two images is first presented by introducing an affinity 
measure between image point features, based on their distance from a 
hypothetical projected 3D pseudo-intersection point. A similar approach to 
matching and triangulation from noisy 2D image line segments across three 
images is proposed by introducing an affinity measure among 2D image 
line segments via a 3D pseudo-intersection line. 
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Author(s): Cheng, Y.-Q.; Wu, V.; Collins, R.T.; Hanson, A.R.; Riseman, 

E.M. 

Author Affiliation: Dept. of Comput . Sci . , Massachusetts Univ., Amherst, 
MA, USA 

Journal: Proceedings of the SPIE - The International Society for Optical 
Engineering Conference Title: Proc. SPIE - Int. Soc. Opt. Eng. (USA) 
vol.2727, pt.l p. 453-62 

Publisher: SPIE-Int. Soc. Opt. Eng, 

Publication Date: 1996 Country of Publication: USA 
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SICI: 027 7-78 6X (1996) 2727: 1L. 4 53:MWBM; 1-L 
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U.S. Copyright Clearance Center Code: 0 8194 2103 0/96/$6.00 
Conference Title: Visual Communications and Image Processing '96 
Conference Sponsor: SPIE; IEEE 

Conference Date: 17-20 March 1996 Conference Location: Orlando, FL, 
USA 

Language: English Document Type: Conference Paper (PA); Journal Paper 
(JP) 

Treatment: Theoretical (T) 

Abstract: An important and difficult problem in computer vision is to 
determine 2D image feature correspondences over a set of images . In 
this paper, two new affinity measures for image points and lines from 
different images are presented, and are used to construct unweighted and 
weighted bipartite graphs. It is shown that the image feature matching 
problem can be reduced to an unweighted matching problem in the bipartite 
graphs. It is further shown that the problem can be formulated as the 
general maximum-weight bipartite matching problem, thus generalising the 
above unweighted bipartite matching technique. (14 Refs) 
Subfile: B C 

Descriptors: computer vision; graph theory; image matching; image 
sequences 

Identifiers: maximum-weight bipartite matching technique; image feature 
matching; computer vision; 2D image feature correspondence; affinity 
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Mining database treasures . (Data Warehousing) (Technology Information) 

Rosen, Cal 

Computing Canada, v22, n20, p42(l) 
Sep 26, 1996 

ISSN: 0319-0161 LANGUAGE: English RECORD TYPE: Fulltext; Abstract 

WORD COUNT: 672 LINE COUNT: 00060 

... they can make a customer a lot of money." 

Some common yet powerful data mining applications spanning multiple 
industries include: 

* market basket analysis or product affinity analysis (What 
products or services are most frequently purchased as a group ? What are 
the 'on ad 1 products which drive the highest value affinity sales?). 

* customer retention/vulnerability (What are the factors or 
characteristics which predict that a customer is on the verge of 
cancellation?) . 

* customer acquisition life cycle. . . 
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Online Execs - "Churn" Still A Problem. 

Newsbytes, pNEW10250033 
Oct 25, 1995 

LANGUAGE: English RECORD TYPE: Fulltext 

WORD COUNT: 716 LINE COUNT: 00058 

... doors from the Internet community to those portions of the service. 

In Gangi ' s view, the place where accesses come from will not be as 
important in the future as it is today. 

"The number of subscribers is largely irrelevant. What is apparent 
from the extraordinary growth curves is that most Americans will have 
Internet access by the end of the decade. This is not and shouldn't be 
looked at as traditional mass media. We are talking about how to take this 
interactive media and create affinity groups for them," he said. 

In the short-to medium-term, however, all the online services are 
worried about "churn." The word refers to the phenomenon... 
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Paradigm Shift, (new era of information technology) (DMR Group Inc. 

executives Don Tapscott and Art Caston write new book, Paradigm Shift) 
(includes excerpts from publication) (Interview) 

Colanna, Jerry 

Information Week, n394, p34(5) 
Oct 5, 1992 

DOCUMENT TYPE: Interview ISSN: 8750-6874 LANGUAGE: ENGLISH 

RECORD TYPE: ABSTRACT 

...ABSTRACT: enterprise. The authors argue that there is a connection 
between economic, organizational, political and technological change. In 
addition, the book suggests the importance of a value network via the 
creation of the extended enterprise that involves customers , suppliers, 
as well as affinity groups . 
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Language: English Record Type: Fulltext 
Document Type: Newsletter; Trade 
Word Count: 4 34 

... spending patterns and offer products appealing to those behaviors. 

Most of those surveyed agree. "What you'll see coming up in the future 
is pseudo affinity card products , creating products around groups 
or communities of interest," says another. For instance, rather than 
partnering directly with a supermarket, issuers may instead opt to offer 
value cards, providing manufacturer discounts to loyal customers . "Every 
time participating merchants swipe the card, the issuer will capture some 
information, sell back to merchants, and on the third visit the consumer 
gets . . . 
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... s strength is its interface. Once consumers are in the door, Amazon 

has what appears to be a defensible position in functionality. Online 
reviews by readers , affinity groups , and preference-matching 
technologies can be duplicated, but require a large installed base, current 
and historical, to be valuable . Amazon has already serviced one million 
unique customers , and has developed considerable expertise and experience 
in the field. Most importantly, the company's superb customer service and 
sense of community bring users back. . . 
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Word Count: 821 

... designed to support and extend community for its users. PlanetAll 1 s 

Virtual Address Book automatically updates members' address books when 
friends/contacts change personal information. Users can also join 
multiple affinity groups that are categorized according to interest. 
PlanetAll' s Crossing Paths feature keeps travel plans and itineraries 
online and automatically notifies members if friends/contacts will be 
visiting a designated geographic region. The Friends of Friends service 
allows members to make contacts through established friends and 
acquaintances. Email reminders assure that members do not lose track of 
important dates and events and PlanetAll also provides a daily email 



4 update that delivers news, sports, business, horoscopes and friends and 
contact information. 

"We are committed. . . 
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Language: English Record Type: Fulltext 
Document Type: Newswire; Trade 
Word Count: 1413 

... in the PlanetAll site. Outtahere, available immediately to 

PlanetAll members, provides travel planning and reservation capabilities, 
making travel planning as easy as clicking a button. 

Groups , Group Messages , Group Levels and Details. PlanetAll 
members can create or join affinity groups important to them, from 
high school and college alma maters to company affiliations and other 
special interests. The new interface now lets members view the entire 
history of messages posted within the group . Members will be able to 
create and join subsections of larger groups --letting members direct 
messages to a smaller constituency when appropriate. 

Planning, Scheduling and Reminders. Via PlanetAll 's Crossing Paths 
feature, members can keep their travel schedules online and can... 
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Language: English Record Type: Fulltext 
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. . . they begin to be sophisticated consumers of merchandise across all 

product categories; any entertainment property that aims to be a 
blockbuster must incorporate this age group among its target customers . 

Television and home video are still important entertainment 
vehicles, although the emphasis changes from edutainment toward pure 
entertainment. Books increase in difficulty from picture books to 
storybooks and easy readers. New product... 
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four-pronged attack" that includes wireless, long-distance, local 
wireline, and business-to-business high-speed access. In residential 
markets, however, TNV expects to see value propositions based on 



discounted telephone service for subscribers to TCI's cable or 
satellite-based entertainment services. That brings 13 million subscribers 
into the targeted affinity marketing group . 

Where f s the Value ? It is not accurate to characterize voice 
messaging as the core value proposition for these bundled services. The 
power gravitates toward the company that provides . . . 
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... doors from the Internet community to those portions of the service. 

In Gangi ' s view, the place where accesses come from will not be as 
important in the future as it is today. 

"The number of subscribers is largely irrelevant. What is apparent 
from the extraordinary growth curves is that most Americans will have 
Internet access by the end of the decade. This is not and shouldn't be 
looked at as traditional mass media. We are talking about how to take this 
interactive media and create affinity groups for them, " he said. 

In the short-to medium-term, however, all the online services are 
worried about "churn." The word refers to the phenomenon... 
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... s first year of operation. 

'The first few will be Internet-related technology companies for the 
obvious reason that they have a natural base of affinity groups - Web 
users who we think will be significant buyers of their equity through 
our company, ' says Klein. 

Klein hopes that the promise of such investors also will convince 
conventional investment banks to give... 
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they can make a customer a lot of money." 
Some common yet powerful data mining applications spanning multiple 



* industries include: 

* market basket analysis or product affinity analysis (What 
products or services are most frequently purchased as a group ? What are 
the v on ad 1 products which drive the highest value affinity sales?) . 

* customer retention/vulnerability (What are the factors or 
characteristics which predict that a customer is on the verge of 
cancellation?) . 

* customer acquisition life cycle... 
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... fiction, classic movies and sports, can't match NMN ' s demographics. 

Coach travelers fall into 'three homogeneous markets: under 24, adults 
and seniors, ' Kirchner said. Importantly , those markets are rarely mixed 
on the coaches, since many people travel in affinity groups . 

NMN can also help marketers through the NMN Television Network. Lisa 
Wilkinson, formerly head of marketing for Host Marriott, which operates 
restaurants at many of. . . 
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American Banker, pl4 
Nov 6, 1992 

Language: English Record Type: Abstract 
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ABSTRACT: 

...addition to the credit cards already issued to 370,000 customers. 
Primerica had, until now, solicited customers by issuing gold cards through 
Primerica Bank to affinity group members . The card will now be 
marketed to policyholders of Primerica Life. MasterCard is positioned as 
the card for value -conscious consumers . The Primerica Financial 
Services MasterCard offers a 1% rebate on charged purchases, with a 
14.95%/yr rate for the most creditworthy policyholders. The firm... 
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Document Type: Magazine/ Journal ; Trade 

ABSTRACT : 

...international product, accident and health coverages, have grown at 
double-digit rates for the past six years. Aon ' s niche strategy is focused 
on offering value -added products and services that have low 
underwriting risk or that generate fee income. Aon is an insurance and 



financial services company with four major operating segments: 1) accident 
& health insurance; 2) insurance brokerage; 3) life insurance; and 4) 
specialty property/casualty. The company markets supplemental indemnity 
accident and health products to individuals , businesses, affinity- 
groups and associations in North America, Europe and the Pacific. 
Tables in report: Stock Price & Earnings Data 1991-93; Quarterly Earnings 
Per Share 1989-93; Annual... 
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...except that a large number of people can browse through the messages 
left there. Bulletin boards are often dedicated to one purpose, such as an 
affinity group . Messages of interest and importance to group 

members can be posted here. The bulletin board can be maintained by one 
designated person, who is to make the bulletin board interactive. That is, 
people . . . 
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... of these techniques. Short showed a picture of a waterlily, which 

he used as a reference image, and asked his system to retrieve eight 
similar objects . The eighth likeness was a bunch of bananas, and one 
could actually detect the seemingly absurd relationship. However, one could 
limit the search to flowers and come up with a more relevant set . 

Concept-based retrieval of images would be the next revolution of 
image retrieval and Short predicted it was 3-5 years off. (Brenner's law 
says to double every prediction... 
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... designed to support and extend community for its users. PlanetAll 1 s 

Virtual Address Book automatically updates members' address books when 
friends/contacts change personal information. Users can also join 
multiple affinity groups that are categorized according to interest. 
PlanetAll ' s Crossing Paths feature keeps travel plans and itineraries 
online and automatically notifies members if friends/contacts will be 



visiting a designated geographic region. The Friends of Friends service 
allows members to make contacts through established friends and 
acquaintances. Email reminders assure that members do not lose track of 
important dates and events and PlanetAll also provides a daily email 
update that delivers news, sports, business, horoscopes and friends and 
contact information. 

"We are committed. . . 
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... in the PlanetAll site. Outtahere, 

available immediately to PlanetAll members, provides travel planning 
and reservation capabilities, making travel planning as easy as 
clicking a button. 

Groups , Group Messages , Group Levels and Details. PlanetAll 

members 

can create or join affinity groups important to them, from high 

school 

and college alma maters to company affiliations and other special 
interests. The new interface now lets members view the entire... 
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... is an obvious advantage. Any list that is used by multiple vendors 

for the same product or is heavily saturated with frequent mailings can 
lose value quickly. An affinity group that endorses similar products 

from competing insurers greatly diminishes the value of their 
endorsement . 

Product Characteristics 

Because auto and homeowners insurance are commodity products, price 
is the dominant characteristic in any decision to switch insurers. This... 
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TEXT: 

MBNA Corp. understands the importance of segmenting by profession 
and affinity group to get loyal customers . 
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. . . management might subtly or openly encourage the formation of such 

groups. White males usually do not have a group of their own. 

Mission of advocacy/ affinity groups . The group members are 
likely to focus on the issues most relevant to their own membership, for 
example, glass ceiling issues for females, perceived disparities in 
application of programs to members of the individuals 1 racial or ethnic 
group . The tone is "we must band together to provide mutual support, draw 
attention to our issues and get fair treatment." 

Example of an advocacy group. . . 
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. . . has no immediate plans for co-branding parent company Dean Witter 

Discover and Co., New York, is co-branding a MasterCard called Prime 
Option . 

Unlike affinity cards, which solicit individuals with common 
group interests, co-branded cards use a recognizable brand name or logo to 
cultivate customers . "The key is to bundle value -added services based 
on customer attributes," Valenza says. 

Credit-card marketers agree. Stephen Bartell, MasterCard 1 s 
vice-president of co-branded marketing, terms the concept... 
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Hard-headed showman. (Daily Mail Managing Director Guy Zitter) 
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... the Daily Mail's strategy. This is achieved by mounting circulation 

and building promotional activities which run before, during and after each 
event . 

"We target affinity groups and prospective new readers through 



1 high-profile shows and exhibitions - and use a relevant promotional 
device that will tie them to the newspaper after they have left the show, " 
explains Mike Halstead, joint managing director of HH&S, Associated. . . 
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Affinity groups: the missing link in employee involvement. 
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common problem of optimizing sub-systems, which can cause 
sub-optimization of the larger organization. 

3. TO IDENTIFY AND ADDRESS EDUCATION AND TRAINING NEEDS . When 
affinity group members find they need additional knowledge {such as 
training in quality management) or skills (such as how to perform a 
particular process), they are empowered to acquire the necessary resources. 
Education and training is particularly important for groups composed of 

members who have had no previous experience in continuous improvement 
(which is often the case with groups of administrative employees ) . 

4. TO BUILD TRUST AND COHESIVENESS . By spending time together, members 
get to know one another, learn how to work together as a team, and. . . 
members, this kind of exposure was not typical. Affinity groups force time 
onto people's schedules and create new kinds of opportunities for contact. 

Another significant benefit of affinity groups is that they help 
members gain an increased appreciation and understanding of the overall 
organization. Rather than narrowly focusing on their own divisions or 
departments, group members said they came to understand problems and 
issues their peers, bosses, and subordinates were facing. They also better 
understood how their position fit within the... 

. . .by definition, create new problems as they solve others, but a priority 
for group members was to learn to become better problem solvers. While some 

group members questioned the value of affinity groups at first, 
each group made progress. This progress demonstrated the value of affinity 
groups to everyone at NP; we believe the same will occur at... 
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. . . Another affinity product targeting the black community was 

announced in April. Black Americans of Achievement Inc. of San Diego, 
Calif., a firm that develops marketing products for various ethnic 
groups , plans to launch a credit card bearing its name. Some of the 
profit would go into scholarship funds for black students. 

Black Americans of Achievement signed an agreement with Affinity 
Partners of . . . 
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... just the right type of target market. The Internet is filled with 

these types of "lists"; communities of people that have segmented 
themselves into certain affinity groups . But when marketing to people 
on the Internet, businesses need to have a whole new consciousness. Unlike 
the traditional, and old- fashioned approach of just cranking out obnoxious 
and often unwanted ads, today's business needs to market itself by 
participating with its users in sharing ideas, solving problems and 
adding value . The key question is, do you have anything interesting to 
say? Your presence, then, with a relevant Internet gathering of your 
customers , will be a valuable one. 

Also, when you participate with your customers on the Internet, you 
also are able to learn what some of your customers problems are, and can be 
more responsive. The point is, whether... 
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. . . continued outstanding underwriting performance, conservative 

operating strategy and strong capital position. These positive rating 
factors are derived from management's successful underwriting approach of 
targeting affinity groups , largely government employees and military 
personnel, as well as its significant competitive expense advantage 
through its direct response distribution system. Partially offsetting these 
positive rating factors is the group's modest exposure to catastrophes, as 
evidenced. . . 
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skip payments for teachers during the summer and for union workers 
during strikes, are examples of other vital benefits. 

One more point bears mentioning: the value of "denial education" 
cannot be overstated. Because not all members of an affinity group 
will qualify for the group 1 s credit card, it is important to explain the 
approval process in a newsletter. This helps reduce the incidence of... 



16/3, K/30 (Item 14 from file: 148) 

DIALOG (R) File 148: Gale Group Trade & Industry DB 



1 <c)2003 The Gale Group. All rts. reserv. 

02860875 SUPPLIER NUMBER: 04270122 (USE FORMAT 7 OR 9 FOR FULL TEXT) 

Nationwide Legal Services Inc. acquires American Legal Access Systems Inc. 

PR Newswire, NYPR7 
June 5, 1986 

LANGUAGE: ENGLISH RECORD TYPE: FULLTEXT 

WORD COUNT: 308 LINE COUNT: 00026 

... by Nationwide and Beneficial. 

Kirschner stated that ALAS will concentrate its sales efforts in the 
individual consumer and small business marketplace, while Nationwide 
continues its emphasis on sales to members of affinity groups and 
employee organizations. 

Nationwide is a marketing, sales and administrative organization for 
group and prepaid legal service plans on a national scale. Its common stock 
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. . . t like shopping in supermarkets; they do it reluctantly, 

grudgingly. " 

A way of making shopping easier to handle at the A&P Futurestore is 
the clustering of products that have "natural affinities , " such as 
produce and floral, and some service and non-service departments. It f s 
important "to make these relationships clear, " Gersin says, "so that the 
customers can see them quickly and, in essence, either pre-plan their 
trip, or take advantage of spontaneity by having virtually all of the store 
identified. . . 
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WHY FIREFLY HAS MAD AVE. BUZZING: The Internet startup takes word of mouth 
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Section Heading: Marketing: THE INTERNET 

Word Count: 1,380 *Full text available in Formats 5, 7 and 9* 

BYLINE: 

By Paul C. Judge in Boston 
TEXT: 

... ZDNet T s vast library of shareware. * x Our technology enables a brand to 
do two things: get the information you want to get to the right people , 
and build affinity groups around your brand, ' 1 says Saul Klein, 

Firefly's vice-president of marketing. 

Analysts and other marketers say Firefly is perhaps a year ahead of 
others . . . 
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Harmony and patriarchy: The cultural basis for 'paternalistic headship' 
among the overseas Chinese 

Westwood, Robert 
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...TEXT: seek to structure their organizations around personalistic 
relationships based on trust and mutuality. Naturally, family relationships 
inherently possess these qualities and heads prefer to locate family 

members in key organizational positions. Where this is not possible, they 
favour people with whom there is some type of in-built and reliable basis 
for a relationship; for example, people from the same clan, kinship, 
language/dialect or heritage group . Sometimes, people not connected in 
this way may attain the status of quasi- family members , with similar 
bonds of mutuality, by sustained support and loyalty. Beyond this level, 

affinity between the head and employees is more restricted, but still 
the quality of the relationship is important . Employees who have 
demonstrated their loyalty and respect can attain a trusted status and 
expect some consideration from the head. Full-time, permanent employees in 
key. . . 
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An examination of influences leading to Americans ' 
policy of free trade 
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...TEXT: more abstract, multiple values to more concrete beliefs toward 
policy issues as a "funnel." 

Social Categorization 

Social categorization refers to persons' tendency to place other persons 

into conceptual groupings and to differentiate their own grouping from 
other groupings of persons (Hogg and Turner 1987; Tajfel 1981a). That 

grouping into which persons categorize themselves is often termed the 
in- group (Brewer 1979; Turner 1991); persons tend to endorse their 
ingroup and more important , to deprecate out-groups (Campbell and 
McCandless 1951; Ray and Lovejoy 1986). Such deepseated feelings of 
acceptance of in- groups and their members and rejection of out- groups 

and their members become internalized as values. The in-group of 
particular interest to this study is persons ' national grouping -here, 
Americans-an in- group to which many persons have a particular affinity 

(Turner 1982, 1987; Waheeduzzaman and Marks 1989) . 

An important outcome of social categorization is the formation of 
stereotypical images of other groups and, by extension... 
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ABSTRACT: The CNA Insurance Cos.' new payroll deduct personal lines program 
offers any of the coverage segments in CNA's universal security policy 
(USP) portfolio to employees of corporations or members of affinity 
groups . CNA is already a significant marketer of life-health products 
via payroll deduction. The company sees payroll deduction as an efficient 
way to deliver its popular USP package policy as well. USP originally was 
designed. . . 

...TEXT: Insurance Companies' new payroll deduct personal lines program. 
The program offers any of the coverage segments in CNA's universal security 
policy (USP) portfolio to employees of corporations or members of 
affinity groups . 

CNA is already a significant marketer of life health products via 
payroll deduction. The company sees payroll deduction as an efficient way 
to deliver its popular USP package policy as well. 

USP originally was designed. . . 
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A comparative analysis of the affinity card market in the USA and the UK 
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...TEXT: to gauge affinity card-related awareness, knowledge and attitude. 
To achieve the integrative approach suggested necessary in examining 
affinity card partnerships, an additional four focus group discussions 
were held with members of the general public in the USA and the UK. The 
objective was to explore the general awareness of affinity cards, their 
perceived benefit, relevant financial incentives, structures and the 
importance of various design layouts . The participants were encouraged 
to focus on all participants in the affinity card programme, including 
banks, charities and cardholders. 

Findings 

The primary research revealed the key motivations behind entering into and 
satisfaction with affinity card programmes. In... 
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Feds terminate rules that limited software purchases 
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...TEXT: behind the report had been determine how the government might 
bring about convergence between the two primary competing open standards 
suites — OSI and TCP/IP. 

AFFINITY GROUPS 

But members of the group , which was headed by Department of Defense 
Telecommunications Director Diane Fountaine, went further afield. 



' Frustrated by interoperability gaps in OSI products and lured by the 
growing influence of TCP/IP, the federal network managers concluded that 
the government should ditch mandatory standards. 

Instead, the report suggests federal agencies could work toward 
interoperability. . . 
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Characteristics and customs: Empirical evidence on the union- joining 
decision 

Ingham, Mike 
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...TEXT: is in accord with the traditional expectation. As for the presence 
of dependent children, the evidence once more rejects the full hypothesis 
that unionization and family responsibilities are independent with 
employees having dependent children being significantly more likely to be 
union members . 

Tastes for unionization are normally held to be influenced by educational 
attainment and occupational status. The usual line of argument is that the 
more highly educated are likely to feel themselves better able to. . . 
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Affinity marketing: What is it and how does it work? 
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...TEXT: a specially tailored series of inducements appealing to the 
specific needs of the group member. An. understanding of the lifestyle, 
consumption patterns, and interests of group members is essential to 
this task. For example, the AAA offers members discounts on car rental, 
lodging, auto repair, and insurance. The enhancement package is most 
significant for the nominal affinity group because members of such 
groups have minimal social bonding and commitment to a group mission; 
their predominant motivation for membership is "What's in it for me?". 

AFFINITY MARKETING VERSUS... 
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Third Party Endorsements : Are They Worth the Effort? 
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ABSTRACT: Endorsements, or testimonials, are significant forces for 
selling a wide range of products . Endorsements are especially effective 
in selling insurance. This is because most insurance policies are bought on 
trust in the honesty of the underwriting company and. . . 



... 8 major factors can help in evaluating possible third party endorsers. 
' The 8 factors include: 1. the size of the group, 2. the degree of 

affinity that exists among group members , 3. the strength of the 
endorsement, 4. the availability of demographic data, 5. the availability 
of established communications within the group, 6. whether the individual 
must pay to belong to the group, 7. the mail-order buying history of 
members of the group , and 8. the condition of the names on the list. 
Material for a third-party marketing effort should address itself directly 
to the market being solicited, rather than to the broad market. 
Establishing an insurance representative to whom group members can 
appeal for assistance or advice can be helpful. Also, companies should 
consider a specific relaxation of underwriting requirements when they are 
making group offers. 
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Perceptual Awareness of Energy Requirements of Familiar Objects 
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ABSTRACT: Three experiments were conducted to measure undergraduates' 
perceptual awareness of energy requirements in their immediate environment. 
In Experiment 1, subjects sorted 61 energy items into groups based on 
general likeness and on likeness in terms of energy consumed in an hour 
of continuous use. Cluster analysis indicated that items were grouped 

according to function and size but not by energy requirements. In 
Experiment 2, subjects rank ordered 19 household appliances by energy 
consumption and size. The judged order of energy consumption matched the 
actual order moderately well, but important deviations from accuracy was 
apparent for items whose perceived volume seemed to have an influence 

on perceived energy consumption. Some large objects were seen to 
require high energy even though their actual consumption is relatively low. 
Similarly, small objects were believed to use small amounts of energy. . . 
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Beyond Laptops 

Computerworld Financial Services Journal 

Fielding a sales force with laptop computers to capture and process 
mortgages is not the most cost-effective approach, contends industry 
leader HomeSide Lending 

Byline: Mark Halper 

Journal: Computerworld Page Number: Fll 

Publication Date: October 01, 1996 
Word Count: 1188 Line Count: 113 

Text : 

... operations. Because the loan officers and their laptops worked out of 
branch banks, this shift spelled the end of the system. Further, HomeSide 
decided to emphasize secondary mortgage products such as home equity 
loans and lines of credit rather than mortgage originations. To observers 
of the mortgage industry, HomeSide' s rebuke of portable, face... 

... the gathering of property value data. And on the horizon are EDI links 
with mortgage insurance companies. "The whole idea is to not have a bunch 

of people running to the fax machine or checking Federal Express and 
using all the things we've been using,'' DeOsca said. Tweaking PMIAura, 
the underwriting software... has a four-pronged marketing initiative to 



* 25/3, K/l (Item 1 from file: 275) 

DIALOG (R) File 275: Gale Group Computer DB(TM) 
< (c) 2003 The Gale Group. All rts. reserv. 

02263932 SUPPLIER NUMBER: 19268787 {USE FORMAT 7 OR 9 FOR FULL TEXT) 

Data mining you can afford, (low-cost, simpler tools for extracting data 
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... to specific industries, vendors are rolling out versions aimed at 

vertical markets. HyperParallel, in San Francisco, offers customizable 
algorithms that can be plugged into its data mining engine to address 
specific needs. For example, a retailer can use its Affinity module to 
look for relationships between sales of different products, a process known 
as market-basket analysis. This year, IBM will add customizable 
applications, including. . . 
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TEXT: 

Last month I wrote about an important set of data mining 
activities, which included clustering, classification, predicting, and 
affinity grouping (market basket analysis). (See "Digging into Data 
Mining , " DBMS, October 1997) I hope I whetted your appetite and you are 
anxious to begin mining with one of the data mining tools you found on 
Larry Greenfield ! s Web page (pwp. starnetinc . com/larryg/index . html ) . But, 
are you ready? Does anything have to be done to your data, or can any data 
warehouse automatically be used for data mining ? The answer is that 
often a significant amount of work needs to be done to prepare your data 
for data mining . In fact you may spend more effort getting the data 
ready for data mining than you will spend actually doing the data 
mining . This month, I explore many of the data transformations you will 
need to perform. 
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. . . warehouse systems is the necessary ingredient that has made data 

mining real and actionable. 



The Categories of Data Mining 
The best way to talk about data mining is to talk about what it 
does. A useful breakdown of data mining activities includes: 
clustering, classifying, estimating and predicting, and affinity 
grouping. For the discussion of this taxonomy I am indebted to Michael 
Berry and Gordon Linoff for their wonderful new book, Data Mining 
Techniques for Marketing, Sales, and Customer Support (John Wiley & Sons, 
1997) . 

An example of clustering is looking through a large number of 
initially undifferentiated customers. . . 
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ABSTRACT: Data mining is being embraced as a panacea by many 
information-technology professionals, but the term 'mining' is often used 
to refer to wildly different concepts. True data mining involves much 
more than typical online analytical processing or traditional SQL queries. 
It demands an elaborate, automated analysis of detailed operational data 
from customer transactions in order to identify trends, patterns and 
interrelationships and predict future behavior. Robust data - mining 
environments use analytic and mathematical modeling and often employ such 
techniques as neural network technology, statistical analysis, induction, 
time sequencing and clustering. Mathematical algorithms are... 

...to detailed operational data and refined over successive iterations 
until the program achieves the highest possible accuracy. Key mining 
applications include market basket analysis, product affinity analysis, 
customer retention, customer acquisition, price optimization, risk 
management, segmentation and target marketing. 



25/3 ,K/5 (Item 1 from file: 621) 

DIALOG (R) File 621:Gale Group New Prod.Annou. (R) 
(c) 2003 The Gale Group. All rts. reserv. 

01495607 Supplier Number: 47156368 (USE FORMAT 7 FOR FULLTEXT) 
Red Brick Demonstrates Data Warehouse Performance Leadership in Retail and 
Consumer Packaged Goods 

PR Newswire, p0225NYTU007 
Feb 25, 1997 

Language: English Record Type: Fulltext 
Document Type: Newswire; Trade 
Word Count: 1133 

. . . pharmaceutical sales to manage. Currently, Longs has a 

sophisticated sales tracking and analysis system running on Red Brick 
Warehouse and is developing category management and data mining systems 
to build its customer knowledge and affinity programs. 

"Longs' successful fact-based merchandising strategy is built on the 
technology provided by our Red Brick-based data warehouse, " said Brian 
Kilcourse, CIO at... 
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... s customers and prospects." 

Tandem's Director of Decision Support Solutions, Rich Ghiossi, feels 
the partnership will be beneficial as well. "Tandem's Object Relational 
Data Mining technology and the broad portfolio of algorithms supported 
by NeoVista's Decision Series suite enables companies, like retailers, to 
find product affinities and improve profitability," said Ghiossi. 

The NeoVista Decision Series 

The NeoVista Decision Series synthesizes pattern discovery and 
recognition tools, relational databases and widely-accepted interface... 
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. . . just specify a department and be automatically presented with 

multi-item affinities. 

About HyperParallel, Inc. 

Based in San Francisco, HyperParallel is a leading supplier of data 
mining algorithms to the retail, insurance, financial and 

telecommunications industries. HyperParallel 1 s algorithms include Affinity 
for market basket analysis, Sequence for customer retention, Induction for 
target marketing and Cluster for market segmentation. 

For more information, contact HyperParallel at 282 Second. . . 
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. . . the risk of losing a customer; classifying customers with seemingly 

unrelated characteristics into market segments-of-one; predicting future 
demand by discovering product and service affinities , and, subsequently, 
providing bundled offerings; and managing marketing campaigns. It 
incorporates IBM's powerful data mining algorithms. 

DecisionEdge for Utilities is the second in a series of 
industry-specific solutions introduced by IBM. It follows DecisionEdge for 
Telecommunications, an offering designed. . . 
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TEXT: 

TOOLS TAILORED TO DIFFERENT FUNCTIONS, INDUSTRIES: IBM GRABBED A commanding 
lead in the sparsely populated data mining market last week with the 
announcement of tools and services to help customers find specific kinds of 
warehoused data, analyze it and apply it in ways never thought of before, 
analysts said. Analysts across the board praised IBM's entry into data 
mining , a market served mainly by small start-up companies offering niche 
products. IBM, they said, has taken a coherent functional approach that 
makes the advantages of data mining clear to customers. In fact, the 
IBM toolkit and services could spark heightened interest in mining data 
warehouses for information once thought unavailable to company 
decision-makers, analysts said. The new products and services include the 
Intelligent Miner, a toolkit for analyzing... 

...throughout an enterprise. The company also announced several customized, 
cross-industry applications, as well as consulting and services support to 
help customers take advantage of knowledge discovery and validation 
techniques. "Business intelligence is not just about building a data 
warehouse -- it's about detecting something that you didn't know before," 
said. . . 

...and predictive modeling, to association discovery, sequential pattern 
discovery, and database segmentation. The Intelligent Miner also includes a 
pre-processing library of tools to prepare data for mining and 
verification. The tools can be invoked dynamically, without coding, during 
the iterative process of preparing, mining, and verification. These tools 
include data selection, transformation. . . 

...propensity to purchase and consumer vulnerability analysis campaigns. 
Item set analysis aims to understand customer buying, behavior and to 
predict their future behavior by identifying affinities among their 
choice of products and services. Fraud detection identifies deviations from 
established usage norms to flag suspicious transactions which may be 
indicative of fraudulent activity. IBM said it would offer its consulting 
and services expertise to help customers design, integrate, and test data 

mining solutions for a wide range of industries, including retail, 
banking, financial services, health care, travel, telecommunications, and 
insurance. HOW DATA MINING WORKS IN THE REAL WORLD: * DEVIATION 
DETECTION COULD BE USED BY A FINANCIAL SERVICES company to detect 
fraudulent use of credit cards by examining deviations... 

...common patterns of symptoms that lead to particular illnesses. Source: 
IBM LOOKING AT UNLOOKED-AT HYPOTHESES Analysts across the board praised 
IBM's entry into data mining . "IBM is taking a strong position in the 
data mining world, " said Robert Moran, director for decision support and 
research with the Aberdeen Group (Boston, Mass.). "Most companies use 
decision support tools as a way to analyze data . Data mining combs 
through data to find unlooked-at hypotheses. This sets the stage for 
decision support tools to look at data with a fresh perspective." "Our 
studies show a. . . 

...other markets," said Aaron Zornes, executive vice president of 
applications development strategies for the META Group (Stamford, Conn.). 
"During 1996, we believe that IBM's data mining toolkit and related 
service offerings will set the pace for large-scale business technology 
initiatives in data warehousing." IBM's offering in this area is... 
...first step," said Brian Murphy, senior analyst with the Yankee Group 



1 (Boston, Mass.). There are many small companies trickling into the 
business, he said, selling data mining products to companies with data 
warehouse software. "But these are basically generic data mining 
tools." "...targeted for later this year. Both Moran and Murphy predicted 
IBM will announce a version for HP-UX and/or Sun Solaris in the future. 
Data mining technology will be available to third party VARs, system 
integrators, and ISVs for outsourcing research projects and tailored 
business intelligence application development. IDS will begin... 
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. . . HBO has tested and favors a UNIX-based Data General Aviion 

platform, for one. Others remain unnamed. 

CompuCare's entry in the UNIX race is Affinity , also running on 
Aviion, a RISC architecture. Compatibility with IBM's RISC 6000 hardware is 
expected in four to six months. UNIX's portability allows Affinity to 
convert mini - computers to Data General within four days, says 
CompuCare COO Randy Parker in Reston, Va. Software cost: $700,000-$l 
million, with peripherals, for over 500 beds. Payback: three years. Parker 
was unable to estimate FTE savings but reports that Affinity cuts 
applications/systems programmers. 

Offered as an upgrade of Sigma Series software, Affinity now offers 
PA, general accounting and O/E. Radiology's coming in... 
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DEERFIELD, IL — Walgreens has begun using Retail Discovery Suite from 
Knowledge Discovery One, Austin, TX. A program called Basketdynamics 
lets retailers look in each shopper's market basket to measure profit, 
margin, product affinities , and 167 other elements of a transaction. 
Retailers can also track inventory, vendor performance, and assortment 
profiles . 

A second program, Promotiondynamics , forecasts how well a. . . 
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second-tier credit card banks will find it hard to stay competitive 
with fast-moving giants, Mr. Burnell added. 

Mr. Saunders said Household would be mining data bases to 
increase the card business under its own brand, and is open to 
relationships with large affinity groups. 

"There are a slew of good people working here and they are all 
focused and motivated, said Mr. Saunders. "Collectively, we will have 
victories . . . 
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... to specific industries, vendors are rolling out versions aimed at 

vertical markets. HyperParallel , in San Francisco, offers customizable 
algorithms that can be plugged into its data mining engine to address 
specific needs. For example, a retailer can use its Affinity module to 
look for relationships between sales of different products, a process known 
as market-basket analysis. This year, IBM will add customizable 
applications, including. . . 
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are just 12 employees so far — specializes in digging up 
undiscovered, and often valuable, information from large data warehouses. 

HyperParallel has created a family of data - mining software modules 
that run on massively parallel processors and clustered Unix systems. 
HyperParallel * s //Discovery engines can perform such warehouse-scavenging 
tasks as detecting affinities and, among products purchased by a 
particular customer, identifying buying trends, recognizing patterns in 
actions, and segmenting records with similar characteristics. 

For example, HyperParallel ' s .. . 
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... money as necessary to retain a customer. 

Clustering is related to classification, but differs in that no groups 
have yet been defined. Using clustering, the data mining tool discovers 
different groupings within the data. This can be applied to problems as 
diverse as detecting defects in manufacturing or finding affinity groups 
for bank cards. 

All of these applications may involve predictions, such as whether a 
customer will renew a subscription. The fifth application type, forecasting 
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. . . are just 12 employees so f ar--specializes in digging up 

undiscovered, and often valuable, information from large data warehouses. 

HyperParallel has created a family of data - mining software modules 
that run on massively parallel processors and clustered Unix systems. 
HyperParallel ' s //Discovery engines can perform such warehouse-scavenging 
tasks as detecting affinities and, among products purchased by a 
particular customer, identifying buying trends, recognizing patterns in 
actions, and segmenting records with similar characteristics. 

For example, HyperParallel ' s .. . 
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mobile checkouts with handheld devices, completing warehouse 
inventories in seconds and isolating problem areas in any store nationwide. 

In-depth analysis of newly captured information ( data mining ) also 
allows retailers to manage businesses more efficiently. With Boston Market, 
for instance, Harreld found that the chain could more usefully understand 
its stores and judge performance by looking at them in " affinity groups" 
of stores with similar customer bases rather than comparing them to other 
stores in a geographic area. 



25/3, K/18 (Item 3 from file: 148) 

DIALOG{R) File 148:Gale Group Trade & Industry DB 
(c)2003 The Gale Group. All rts. reserv. 



* 08388978 SUPPLIER NUMBER: 17983666 (USE FORMAT 7 OR 9 FOR FULL TEXT) 

Mining data warehouses, (data mining tools) (includes related article about 
data mining) (Technology Information) 

Edelstein, Herb 

Inf ormationWeek, n561, p48(4) 

Jan 8, 1996 

ISSN: 8750-6874 LANGUAGE: English RECORD TYPE: Fulltext; Abstract 

WORD COUNT: 2156 LINE COUNT: 00170 



. . . money as necessary to retain a customer. 

Clustering is related to classification, but differs in that no groups 
have yet been defined. Using clustering, the data mining tool discovers 
different groupings within the data. This can be applied to problems as 
diverse as detecting defects in manufacturing or finding affinity groups 
for bank cards. 

All of these applications may involve predictions, such as whether a 
customer will renew a subscription. The fifth application type, forecasting 
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0 

800, labor 17.3 30.8 

900, health, welfare, urban 3.2 10.8 

N 307 65 

The estimation of (4 ! ) rests on the " affinity " variables 
representing [ [M . sub . iW . sup . 2 ] - [M . sub . iL . sup . 2 ] ] , the differences in the 
voter's matches with the characteristics of the winner... 



. . . ik . sup . 2 ] , is the squared difference between the dates of the voter's 
and the candidate's Ph.D.s.(7) Although other possible affinities come to 

mind , our data set did not permit the construction of many of them. We 
do, however, experiment below with several other possibilities. 
Statistics describing the matches are shown... 
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...TEXT: been effective in cultivating loyalty among given customer groups, 
so that only the minimum necessary expenditure is earmarked toward 
retaining any patron. 

* Clustering. Through clustering, data mining tools discover groups 
within the data. The process can be applied to such problems as finding 
defects within a sales pattern and pinpointing affinity groups for 
proprietary credit cards . 

* Forecasting. While all of the above types of information can involve 



* predictions, such as whether a customer of a store. 
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...TEXT: 1995, and which sales reps sold the most? 

BIS tools are user-driven: a query is defined and launched. By contrast, 
another type of software, data mining tools, employs outer-limits 

techniques such as neural nets, decision trees, and "smart agents" to 
automatically search raw data for significant patterns and relationships. 

Data mining has obvious marketing applications—expert s like to cite 

the case of the supermarket chain that discovered an unexpected affinity 
between diapers and beer but it has a variety of other potential uses, such 
as detecting fraud or product defects. 

All Hype Aside 
Any company. . . 
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... to specific industries, vendors are rolling out versions aimed at 

vertical markets. HyperParallel , in San Francisco, offers customizable 
algorithms that can be plugged into its data mining engine to address 
specific needs. For example, a retailer can use its Affinity module to 
look for relationships between sales of different products, a process 
known as market-basket analysis. This year, IBM will add customizable 
applications, including... 
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are just 12 employees so far-specializes in digging up 



"undiscovered, and often valuable, information from large data warehouses. 

HyperParallel has created a family of data - mining software 
modules that run on massively parallel processors and clustered Unix 
systems. HyperParallel ' s //Discovery engines can perform such warehouse- 
scavenging tasks as detecting affinities and, among products purchased 
by a particular customer, identifying buying trends, recognizing patterns 
in actions, and segmenting records with similar characteristics. 

For example, HyperParallel ' s .. . 
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money as necessary to retain a customer. 

Clustering is related to classification, but differs in that no 
groups have yet been defined. Using clustering, the data mining tool 
discovers different groupings within the data. This can be applied to 
problems as diverse as detecting defects in manufacturing or finding 
affinity groups for bank cards. 

All of these applications may involve predictions, such as whether a 
customer will renew a subscription. The fifth application type, 
forecasting . . . 
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. . . constitutes an asset class, and how that definition applies to real 

world investment alternatives. It shows effective portfolio managers must 
expand their framework beyond 1 capital assets . 1 

An asset class is a set of assets that bear some fundamental 
economic similarities to each other, and have characteristics that make 
them distinct from other assets that are not part of that class. It is not 
sufficient that values of a group of assets simply have a low 
historical correlation with the values of another group of assets . 
If that were the case, then a collection of stocks with very low (or 
negative) betas would be considered an asset class separate from those... 
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. . . turn requires a definition of what constitutes an asset class, a 

definition that applies to real-world investment alternatives. 
DEFINITION 

An asset class is a set of assets that bear some fundamental 
economic similarities to each other, and that have characteristics that 
make them distinct from other assets that are not part of that class. It is 
not sufficient that values of a group of assets simply have a low 
historical correlation with the values of another group of assets . 
If that were the case, a collection of stocks with very low betas might be 
considered an asset class separate from those stocks that make... 
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... to others in a group, is a multifaceted phenomenon. Shaw (1981: 

213) , for instance, used the term cohesiveness to refer to "the degree to 
which members of the group are attracted to each other" and measured it 
using the stated attraction to the group , the general morale of group 
members , and the degree to which members coordinated their efforts. 
Similarly, Katz and Kahn (1978) argued that the integration of a social 
system results from a number of causes but is most directly a function of 



affective factors rather than role requirements. This suggests that within 
a group , individuals 1 personal satisfaction with other members and 
motivation to sustain those relationships are important indications of 
integration. Social integration, therefore, can best be thought of as a 
multifaceted phenomenon that reflects attraction to the group , 
satisfaction with other members of the group , and social interaction 
among the group members (Katz and Kahn, 1978) . It is well established 
that some measures of social integration may be affected by the relative 
similarity of group members (cf . Festinger, 1954; Newcomb, 1961) . 
Similarity in attitudes, for example, has been shown to promote group 
cohesion significantly (Good and Nelson, 1971) . Terborg, Castore, and 
DeNinno (1976) showed in a longitudinal... 
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. . . TEXT : completely . 

Seloff (1990) has described such a system at the NASA Johnson Space 
Centre f s (JSC) film repository in Houston. Designed to manage an extensive 
collection of space- related images , it allows a more traditional 
textual approach as well as a prototype visual thesaurus approach. When 
considering the textual approach, however, it should be pointed. . . 

... is an adaptation of Syracuse University's SIRE system (Noreault et al . , 
1977), uses a number of statistically-based search algorithms which rank 
the retrieved images in order of their similarity to the query. As a 
result, there is an increased probability that the user will find images 

relevant to his or her needs displayed toward the beginning of the set of 
hits, thereby avoiding a potentially time-consuming search through the rest 
of the retrieved images. This is in sharp contrast to normal, Boolean-based 
systems, where relevant images will be scattered throughout a set of 
hits which are presented in a somewhat meaningless sequence, such as 
physical storage order. 

Meanwhile, the JSC system's prototype visual thesaurus is constructed... 
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...TEXT: important as the member's demographic similarity to his or her 
peers within the formally constituted work unit. 

Our perspective also suggests that differentiation among members strictly 
on the basis similarity may result in unintended negative consequences 
for the leader. Diversity in a work group could benefit the leader greatly 
as diverse members are more likely than similar members to interact with 
a nonredundant set of individuals within and, perhaps, outside the 
organization (Milliken & Martins, 1996) . In this way, diverse members would 
be in a position to bring unique resources to the leader, especially when 



the individual characteristics of leader and member differ. This 
possibility suggests an important practical implication: diverse members 
may be more instrumental to the leader in developing nonredundant ties 
with key individuals in the organization than are similar members who 
tend to develop relationships with individuals already in the leaders 
network . 

Leaders and Mentors During Socialization 

A third implication of our propositions involves the respective roles of 
leaders and mentors... 



28/3, K/6 (Item 3 from file: 15) 

DIALOG (R) File 15 : ABI/Inf orm ( R) 

(c) 2003 ProQuest Inf o&Learning . All rts. reserv. 
01092484 97-41878 

A network analysis of charismatic leadership: The case of a police 
department 
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1995 
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...TEXT: of charisma than task proximity. An explanation of these results, 
however, might be advanced suggesting that such findings are instead an 
artifact of pre-existing similarities among group members . Social 
psychological research on attitudinal similarity and liking indicates 
that similarly-minded individuals tend to like each other and also tend to 
see things similarly. That is, people who have similar attitudes, 
backgrounds, and values tend to have more similar views (Byrne, 1971), 
and they will tend to enter into friendship groups with one another. The 
suggestion here would be . . . 

...This would explain the significant degree of agreement between proximate 
others in the friendship network. Moreover, the fact that centrality in the 
friendship networks appears related to individuals 1 deviation from the 
organization average on attributions of transactional leadership support 
this alternative explanation. 

The same argument cannot be made to account for the structural... 
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Document Clustering, Using Macro Retrieval Techniques 

Kochtanek, Thomas R. 

Journal of the American Society for Information Science v34n5 PP: 356-359 

Sep 1983 
ISSN: 0002-8231 JRNL CODE: ASI 

. . .ABSTRACT: annotated bibliography on computerized axial tomography 
literature. Results of a later online search to confirm Dwyer's 
bibliography shows that a high inclusion rate of articles in Dwyer f s 

collection revealed in the later search does not guarantee an acceptable 
access rate. A macro retrieval technique can eliminate this problem. Such a 
technique generates a large, loosely structured set of documents by 
combining references and citations to illustrate subject similarities 

among documents . The macro retrieval algorithm provides a set of 

pertinent documents based upon a known relevant entry document . There 
are 3 steps to the reference and citation approach: 1. Select a relevant 

entry document. 2. Generate a file of potentially related documents 
linked to the entry document. 3. Evaluate the file with respect to an ideal 



file. A test of the first run of the macro retrieval algorithm using 
Dwyer's bibliography showed improved recall value over conventional online 
search techniques. Further iterations improved results even more. A method 
to cluster documents generated by macro retrieval could improve 

searching precision. . . . 
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...ABSTRACT: made, and 6. the outlet, or where the purchase takes place. A 
number of marketing techniques have been developed to identify the segments 
or similar groups of customers within a market. The first step in this 
identification process is to determine the reasons for different customer 
preferences. These factors can be isolated through... 

. . . class and life-cycle classifications, with data drawn from US Census 
Bureau statistics. This technique can illuminate relationships within a 
market segment. For example, business customers tend to exhibit certain 
similarities . Important factors determining their behavior are: 1. the 
size of the business account, 2. the primary service vehicle, 3. the 
location of business customers in relation to markets, and 4. the 
nature of the business. ... 
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members a reason and incentive -- to get on the Internet, 
providing a productive and entertaining Internet experience. 

EarthLink f s strategic Affinity marketing partnerships include Best 
Buy Co., Columbia TriStar Pay Television, CompUSA, CyberMedia, Fox 
Interactive, Hard Rock. . . 
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stores, mass merchants and catalogs, but he also has his eye on the 
future of " affinity marketing ": the Internet and World Wide Web. As a 
longtime online user himself, Gurlitz foresees a time when. . . 
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Abstract (Basic) : EP 229232 A 

The system operates by storing data files in a computer accessed 
data storage device and relating pairs of the data files according to 
user -defined values of user -defined relationship attribute 
parameters. The content of each of the data files is characterised 
according to user -defined values of user -defined file attribute 
parameters and records of the values of the relationship and file 
attribute parameters are stored. 

Groups of the data files may then be identified according to the 
stored relationship and file attribute parameter values. Provisions may 
be provided for transmitting commands to a computer operating system 
when selected files are accessed or modified to invoke execution of 
user created programs. 

ADVANTAGE - Enables user to establish new file attribute whenever 
the need arises and the user is not limited to selecting from among 
fixed number of predefined attributes, 

1/5 

Title Terms: DATA; FILE; MANAGEMENT; SYSTEM ; COMPUTER; DATA; STORAGE; 
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Abstract (Basic) : EP 1308870 A2 

NOVELTY - A set of web pages accessed by a user is detected and 
compared to the web page access pattern of known users to give a 
comparison result. The set of web pages is assigned a combined bias 
value according to bias values assigned to each web page and an 
expectation maximization (EM) process is performed on the set of web 
pages using data from an EM process performed on a training set of 
users . 

DETAILED DESCRIPTION -.The comparison result achieved by mapping 
the set of web pages to a multi-dimensional vector, bias value and 
EM process result are used to assign (140) profile attributes to the 
user which are combined to produce demographic user profile data. 
An INDEPENDENT CLAIM is also included for stored software. 
USE - For determining demographic profile data of Internet users 
for market research purposes. 

ADVANTAGE - Demographic user profile data can be extrapolated 
from analysis of web usage patterns. 

DESCRIPTION OF DRAWING (S) - The drawing shows a flowchart of a 
process for determining user profile attributes through a vector 
comparison . 

Assign profile attribute (140) 
pp; 30 DwgNo 3/16 
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Customized ranking and comparison system for objects such as 
consumer products to enable user comparison , has personalized 
decision making apparatus with stored attribute data to enable user 
to create visible output identifying selection 

Patent Assignee: QUALITY INT SOFTWARE & SERVICES LTD (QUAL-N) 

Inventor: TALYARKHAN R 
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Abstract (Basic) : GB 2379291 A 

NOVELTY - A personalized decision making apparatus to enable a 
user/selector to create a visible output identifying a selection which 
matches the users needs from amongst a number of objects, comprising: 
(a) means for storing data relating to a number of objects that relates 
to a number of attributes that are composed of at least one 
sub-attribute; (b) input means to enable the user to specify a score 
for each sub-attribute of an object group from which the user wishes 
to choose; 

DETAILED DESCRIPTION - (c) input means to enable the user to 
specify a personalized relative relevance of each attribute fro the 
group ; (d) means for generating group object ranking in accordance 
with the specified inputs and the stored data; (e) means fro generating 
a visible diagram in which each object is represented by a specific 
color and in which an area in the diagram representing each attribute 
is scaled by the input relative relevance of that attribute and in 
which within the area representing the attribute the object ranking is 
shown; and (f) means for generating audible or visible text explaining 
the reasons for the object ranking based on the specified inputs and 
the stored data, which text is presented in response to interaction by 
the user with the visible diagram, and the scope of which text is 
limited to only that deemed significant to the user in response to 
calculations based on the inputs fro the user. 

USE - Ranking and comparison of objects e.g. consumer products such 
as cars, washing machines, cameras, where there is a wide choice of 
similar objects/products to choose from that have a large amount of 
features, specifications or attributes that require detailed comparison 
to establish through personal customized preferences which 
obj ect /product is the most appropriate/relevant for the user/selector. 

ADVANTAGE - Enables object/ product comparisons to be both 
simplified and personalized by matching the attributes of the 
objects/ products , the rating information available for each 
object's attributes and the perceived reliability of the information 
with the relative relevance of the different attributes to the 
user/selector. Allows user/selectors to make full use of the 
information and reduces the complexity of the comparison process , 
where ranking and identifying the best obj ect /product under a 
specific set of circumstances is required. 

DESCRIPTION OF DRAWING (S) - The drawings show illustrations of one 
and three attributes respectively. 
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Abstract (Basic) : WO 200161601 Al 

NOVELTY - A characteristic value for each user among multiple 
users and a set of relationships between the users (121-127) 
are maintained. A community ratings (231-237) for one particular user 
is derived, by performing a function on characteristic valve of all 
user related to that particular user . 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following : 

(a) Computer-readable medium storing computer executable 
instructions ; 

(b) Computer system for determining community rating for one 
particular user 

USE - For electronic environments like online trading environment, 
online shopping site, online auctioning site, online person-to-person 
trading site, online gaming site, etc. 

ADVANTAGE - Enhances on-line trading experience for both buyers and 
sellers , thereby increasing community registrations and pool of 
potential trading partners. 

DESCRIPTION OF DRAWING (S) - The figure shows the block diagram of 
community feedback rating determining system. 
Users (121-127) 

Community ratings (231-237) 
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Abstract (Basic) : WO 200111487 A2 

NOVELTY - A drug comparator comprises a product attribute 
comparator that generates product -attribute similarity score (s) of 
drug(s) and reference drug(s); and a composite score generator that 
generates composite quantitative comparison (s) based, at least in 
part, on attribute similarity score (s) comprising product - 
attribute similarity scores . 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for: 

(1) a method of comparing target drug(s) with reference 
drug(s), comprising generating product - attribute similarity 
scores , and composite quantitative comparisons ; 

(2) a storage medium that contains software, that when executed on 
computing system , performs the above method ; and 

(3) a product including a composite quantitative comparison of 
target drug(s) with reference drug(s) based on attribute data 
describing attributes of each target and reference drugs, in which the 
quantitative composite comparison is generated by the above method . 

USE - The drug comparator is for measuring similarities between 
words or between representations of multiple attributes, consisting of 
drugs dosage strength, color or shape, especially those that are 
related to pharmacological products or items. 

ADVANTAGE - The invention is automatically operated thus 
eliminating the involvement of experts in analyzing facts and in making 
decisions. It provides a practical way for comparing very large numbers 
of target and reference drugs. Comparison of particular 

target-reference pair with a large population of other target-reference 
pairs is possible. Thus, the invention is time and cost saving. 

DESCRIPTION OF DRAWING (S) - Figure 1 is a functional block diagram 
of an illustrative user computer. 
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Data analysis apparatus for e.g. analyzing thoughts, perceptions or 
knowledge feelings - in which user selected elements are input, 
grouped and ranked for determining degree of characteristics having 
selected degrees of correlation 
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Abstract (Basic) : WO 9820431 A 

The data analysis apparatus includes an input for inputting or 
selecting data elements according to user commands. A single element 
and pair of elements are formed. A user inputs similar 
characteristics between the pair of elements and difference 
characteristics between the single element and pair of elements. This 
is performed for a number of iterations and element and characteristic 
combinations . 

The elements are then ranked by a user in relation to each 
characteristic and the rankings are analyzed to determine the 
correlation between elements and characteristics . The analysis may 
be expanded or refined and further elements and characteristics may 
be added at any stage . 

USE - Exploring thoughts, perceptions, knowledge and feelings of 
individual for use in e.g. education, commerce, self-analysis, 
entertainment, market research, expert systems, interviewing etc. 

ADVANTAGE - Enables user to explore thoughts, perceptions etc. 
without requiring input from professional interviewer. 
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Abstract (Basic) : US 5438676 A 

The method involves computing initial weights estimating software 
feature significance. A similarity function estimating a user 
perception of object similarity and the weights are passed to an 
estimation procedure, along with software objects and a parameter 
defining the number of objects associated with each feature . This 
gives updated similarity function parameters which are used with 
other parameters to give a set of misclassif ied or poor confidence 
Mavericks . 

The set is analysed to give an approved set indicating 
deferment, firm assignment, software object alteration by user or 
alteration of the main parameters. If the main parameters are altered 
the estimation procedure is repeated until updated output parameters of 
the coefficients are received form the estimation procedure. 

USE/ADVANTAGE - For software structure analysis, reorganisation, 
documentation and monitoring . Allows early detection of software 
structure changes . 
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Abstract (Basic) : EP 434586 A 

An attribute -based classification and retrieval system for 
group technology applications uses a codeless classification system 
. The classification structures are held in hierarchies and an 
attribute file (20) . 

The structures may be searched at any level. Relationships 
between entities and classification attributes are held in a 
parameter file (60) along with parameter values related to each 
entity - attribute pair. The results of queries on the data are stored 
in results files (80) as successive queries narrow the scope of the 
search. 

ADVANTAGE/USE - Avoids the need to preplan and predefine a coding 
and structure for the system . (17pp Dwg.No.5/9) 
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ABSTRACT 

PROBLEM TO BE SOLVED: To provide a document clustering system which can 
classify document data into a number of clusters corresponding to a 

clustering ob j ect . 

SOLUTION: A group of feature vectors of documents that a feature 
vector generating means 103 generates is decomposed by singular- values 
and document similarity vectors 108 for computing the similarities 

among the documents are generated from the result 106 of the singular- 
value decomposition. A cluster generating means 110 computes the 
distance between an object document and the center of gravity of a cluster 
by using a document similarity vector, classifies the same object 
document for the 2nd time while increasing the number of dimensions of the 
document similarity vector used for the 1st classification , and compares 
the results of the both to determine a cluster having small variation as 
a stable cluster . A data selecting means 109 excludes documents of the 
stable cluster from the object to select object documents to be 
classified next by the cluster generating means, and repeats this 
trial. The classification is repeated in steps to determine the number of 
clusters corresponding to the object even when the number of clusters 
is not determined in advance. 
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ABSTRACT 

PROBLEM TO BE SOLVED: To provide an information sorting system capable 
of performing information sorting on which a user's taste is surely 
reflected. 

SOLUTION: Correlation values to express degrees of similarity in tastes 
between users are held in a correlation value file F4 of each user . 
And plural pieces of object information to be presented, for example, some 

attributes such as, manufacturers, functions and prices are set for 
each piece of merchandise information and the correlation value of 
the user is updated at any time based on an evaluated result for each 

attribute by the user by a correlation value managing part 13. The 
information to be presented to which the same attribute as the one which 
is relatively highly evaluated by an object person to whom the information 
is presented is sorted among pieces of the object information to be 
presented which is applicable to the tastes of other users based on a 
correlation value after update by taking the opportunity of a request from 
the object person to whom the information is presented as one of the users 
by an information managing part 12 and an information presenting part 14. 
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ABSTRACT 

PURPOSE: To provide the similarity degree table preparing device in a 
conception hierarchy , for preparing a similarity degree table between 
each attribute value is accordance with a user 1 s intention, while 

maintaining the balance and the consistency of the degree of similarity in 
the whole conception hierarchy . 

CONSTITUTION: A similarity degree temporary file 7 stores a similarity 
degree table. The constraint inputted from a constraint input part 9 is 
stored in a constraint file 10. When a user inputs the new degree of 
similarity from a similarity degree input part 11, a similarity degree 
setting part 12 sets the new degree of similarity to the similarity 
degree table. The user selects a propagation state from a propagation state 
input part 16. A similarity degree propagating part 13 propagates the new 
degree of propagation to other degree of similarity stored in the 
similarity degree table, based on the constraint. A matching adjusting part 
14 adjusts each degree of similarity of the similarity degree table, based 
on the constraint. 
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Publication Year: 1996 

CODEN: MALEEZ ISSN: 0885-6125 

Language: English 

Document Type: JA; (Journal Article) Treatment: T; (Theoretical) 
Journal Announcement: 9607W1 

Abstract: This paper presents a new form of exemplar-based learning, 
based on a representation scheme called feature partitioning, and a 
particular implementation of this technique called CFP (for Classification 
by Feature Partitioning) . Learning in CFP is accomplished by storing the 
objects separately in each feature dimension as disjoint sets of values 
called segments. A segment is expanded through generalization or 
specialized by dividing it into sub-segments. Classification is based on a 
weighted voting among the individual predictions of the features , 
which are simply the class values of the segments corresponding to the 
values of a test instance for each feature . An empirical evaluation of 
CFP and comparison with two other classification techniques that consider 
each feature separately are given. (Author abstract) Refs. 

Descriptors: Learning systems ; Pattern recognition; Feature extraction; 
Knowledge representation 

Identifiers: Incremental learning; Exemplar based learning; 
Classification by feature partitioning (CFP) 
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Publication Year: 1985 

CODEN: AISJB6 ISSN: 0002-8231 

Language: ENGLISH 

Document Type: JA; (Journal Article) Treatment: A; (Applications); T; 
(Theoretical) 

Journal Announcement: 8511 

Abstract: A new model of viewing a document based on the citing-cited 
relationship between documents is introduced. Using Bayes 1 decision 
theory, it is shown how a source document may be indexed and weighted 
by its set of relevant cited or citing document features , 
corresponding to a one pass relevance feedback Model 1 (probabilistic 
indexing ) or Model 2 (probabilistic retrieval) system. Once every document 
in a collection has been so indexed , various forms of similarity measures 
based on probability of topical relevance between documents are 
derivable, including asymmetric, symmetric, and the relationship with Model 
3. Applications to retrieval and document clustering are also discussed. 
(Edited author abstract) 24 refs. 
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Ability to measure value from the customer 1 s point of view is 
central to the determination of market offerings: Customers will only 
buy the equivalent of perceived value , and companies can only offer 
benefits that cost less to provide than customers are willing to pay. 
Conjoint analysis is the most popular individual -level value 
measurement method to determine relative impact of product or service 
attributes on preferences and other dependent variables. 

This research focuses on how value measurement can be made more 
accurate and more reliable by measuring the relative influence of selected 
methodological variations on performance in prediction and on stability 
of value structure, and by grouping customers with similar value 
structure into segments which respond to product stimuli in a similar 
manner. Influences of the type of attributes included in the conjoint 
task, of the factorial design used to construct the product profiles, of 
the type and form of model, of the time of measurement, and of the type of 
cluster -based segmentation method , are evaluated. 

Data was gathered with a questionnaire that controlled for 
methodological variations, and with a notebook computer as the measurement 
object. One repeated measurement was taken. 

The study was conducted in two phases. In Phase I, influences of 
methodological variations on accuracy in prediction and on respective 
value structure were examined. In Phase II, different cluster -based 
segmentation methods -- hierarchical clustering (HIC), non- 
hierarchical clustering (NHC) , and fuzzy c-means clustering (FUC) — and 
according conjoint models were evaluated for their performance in 
prediction and in comparison with individual -level conjoint models. 
Results show the best models for a variety of design parameters are 
traditional individual -level, main-effects-only conjoint models. Neither 
modeling of interactions, nor segment-level conjoint models were able to 
improve on prediction . Best segment-level conjoint models were obtained 
with a fuzzy clustering method , worst models were obtained with k-means 
and the most fuzzy clustering approach. 

In conclusion, conjoint analysis reveals itself as a reliable method 
to measure individual customer value . It seems more rewarding for 
improvement of accuracy in prediction to apply repeated measures, or 
gather additional data about the respondent, than to attempt improvement on 
methodological variations with a single measurement. 
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Consumer choice models do not consider vagueness, the inherent 
uncertainty in the definition of a concept, despite its prevalence 
throughout the consumer choice environment in product descriptions (price 
is high) and consumer ratings (product is good) . Fuzzy sets have been 
shown to be a suitable representation for vague information, with desirable 
theoretical and measurement properties. This dissertation will test two 
fuzzy set models against a multiattribute conjoint model using identical 
information. Fuzzy set models of choice are hypothesized to improve 
prediction in accordance with the representation of vagueness in a strong 
test of prediction using a comparable prediction methodology . 
Advanced self-administered conjoint software is developed to elicit fuzzy 
sets , which can also adjust product attributes , numeric prototype 
values and vagueness according to individual subject preferences. 

The fuzzy conjoint model is a fuzzified vector weight conjoint 
model that tests only the fuzzy set representation of vagueness. 
Variations in size of the fuzzy sets demonstrate that the representation 
alone is responsible for the large prediction improvements. Prediction 
results for the fuzzy conjoint model are 155 percent better overall than 
the crisp conjoint model, with a first choice prediction rate of 82 
percent for 6 balanced boldout products. Prediction improvements are 
robust across product categories, 192 subjects, 6 studies, alternative 
conjoint models and the omission of estimation information. 

The second fuzzy model performs approximate reasoning from an 
individual linguistic fuzzy rule base using estimation responses. This 
first application of fuzzy production rules to ordinary consumers in a 
computerized interaction leads to the most meaningful model explaining 
prediction . Vagueness factors are highly significant in explaining 
prediction improvements, with the overall explained variance more than 
twice as high as alternative models. Both fuzzy set models significantly 
improve prediction using identical data in an ordinary choice situation. 
The results improve substantially as the representation of vagueness is 
added. This application of fuzzy sets to individual level models 
demonstrates the value of making the maximal use of minimally but 
sufficient ordinal preferences for products and attributes. There is 
considerable potential for both further research and successful 
applications of these models. 
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Marketing concepts applicable to the diffusion of six productivity 
programs by the Texas Hospital Association to hospitals throughout the 
State are presented. The programs in question include a basic productivity 
(best methods ) program and programs in management engineering, management 
reporting, health manpower resource allocation, shared collections, and 
group purchasing. Background marketing concepts with particular 
relevance to the diffusion of innovations are discussed. A problem 
structure for relating program attributes to market characteristics 
is presented, and the relationships between program attributes and the 
rate of diffusion are explored. Sixteen attributes are compared for 
the six productivity programs: initial cost, continuing cost, ratio of 
cost recovery, payoff, perceived relative advantage, risk and uncertainty, 
complexity, clarity of results, compatibility, pervasiveness, divisibility, 
social cost, social approval, scientific status, interpersonal 
relationships, and gatekeepers. Specific marketing strategies are then 
presented for three of the productivity programs — management engineering, 
management reporting, and best methods . The strategies are based on the 
examination of program attributes and the discussion of marketing and 
diffusion concepts. The basic process through which variables 
characterizing hospitals can be used to segment the market for the 
Association's productivity programs and to begin formulation of an 
overall marketing strategy is presented in a market segmentation 
worksheet . 

Descriptors: Texas; Management methods ; Interactions; Health planning 
agencies; Health care delivery organizations; Financial management 

Identifiers: Hospitals; ^Productivity; HRP/BL; HRP/UCA; HRP/ZC; HRP/UD; 
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administration; NTISHRANHP 

Section Headings: 44B (Health Care — Agency Administrative and Financial 
Management ) 
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... of the SQL NULL in these tightly coupled fields. 

The second difference was in the relationship between entities , 
attributes , and values . I had based my model on the idea that entities 
have attributes that can take... 

...V. This makes the query "What things are red?" very easy to answer: Find 
all sets with a color attribute, then find all elements of the set 
which have a value of "red" on that attribute. 

Simple, Files Aren't 

Another reader.. 
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Comparability and comparison levels used in choices among consumer 
products . 

Corf man, Kim P. 

Journal of Marketing Research, v28, n3, p368(7) 
August, 1991 

ISSN: 0022-24 37 LANGUAGE: English RECORD TYPE: Fulltext; Abstract 

WORD COUNT: 4 987 LINE COUNT: 00421 

favorite research topic, but few studies have addressed the issue 
of choice among products with sets of descriptive attributes that do not 
completely overlap, and even fewer have examined how the... 

...Lusch, and Stafford 1979; Pickering 1981). Research on means-end chain 
models has examined the relationships among concrete product 
attributes , abstract attributes , consequences, and values within 
consumers 1 cognitive structures for a product class (Gutman 1982; Howard 
1977; Olson and Reynolds... 
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... of the similarity level between machines. 

We have drawn upon these concepts to develop our similarity values 
between product characteristics (see e.g., Section 4.2). Since our 
interest is not in classifying parts into families (or groups ), the 
clustering techniques following the definition of similarity are not 
beneficial. While clustering techniques could be helpful in identifying 
clusters of similar parts that satisfy a degree of closeness, we have 
instead adopted a ranking. . . 
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. . .ABSTRACT: is important to analyze the data from consumers and to apply 
the quantitative analyses to marketing research . A procedure for 
measuring the preference weights of consumers for attribute levels of 
prototype products is proposed using paired comparison with scores . 
An information-theoretic procedure for measuring the consumer preference 
for each prototype product and the importance or utility of... 
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TEXT: 

...Special features and comments 

-Searches multimedia files (image, sound, Java, etc.). 
-Advanced search turns off relevance ranking . 

-New "Live Topics" feature compares concepts across documents 
-Duplicates are a problem. 
-Concept searching . 
-"More like this. " 
-Duplicates are a problem. 

-Searches multimedia files (image, sound, Java, etc.). 
-Duplicate pages are grouped . 

-Searches multimedia files (image, sound, Java, etc.). 
-Search by fields such as title, keyword. 
Can. . . 



